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THEORIES OF VISUAL ACUITY AND THEIR 
PHYSIOLOGICAL BASES! 
JOHN L. FALK 


University of Illinois 


THE PROBLEM OF VISUAL ACUITY 


Visual acuity is defined as the 
reciprocal of the minimum visible 
angle measured in minutes of arc. 
The assumption is that absolute size 
or distance of the test-object is not 
important, but only the angle it sub- 
tends at the eye. There are some 
indications in the literature that 
acuity may actually be a function of 
distance even when the visual angle 
remains constant, when the distances 
are less than one meter or so (13, 72). 
However, this is explicable in terms 
of the varying efficiency of accommo- 
dation. 

The most common acuity measures 
are those dealing with the minimum 
separable and the minimum visible. 
Minimum separable tests deal with 
the perception of a small gap between 
two parallel bars. Minimum visible 
tests present a single, fine line for 
discrimination upon a homogeneous 
background. Actually, no clear 
distinction can be drawn between 
these two categories, for as we en- 
large the parallel bars of the mini- 
mum separable test, we are at some 
point no longer dealing with this 
kind of test but with a minimum vis- 
ible test; the bars have become homo- 


' The author wishes to thank Dr. Peter 
Milner, McGill University, for helpful com- 
ments. 


geneous background and the gap the 
sole figure (the line to be discrimi- 
nated). Coincident with the above 
transformation will be an improve- 
ment in acuity or the ability to re- 
solve the stimulus figure. There are, 
in fact, a variety of ways for meas- 
uring visual acuity, more often than 
not yielding results which are incom- 
mensurable with one another. For 
example, acuity is maximal for a 
single line, it is somewhat less for 
stereoscopic and vernier methods of 
measurement, while the detection of 
the minimum separable for two 
bright bars is distinctly less well per- 
formed. 

The reader should be clear as to 
the difference between acuity and 
sensitivity. Sensitivity refers to the 
capacity of the organism to respond 
to small values of photic intensity, 
while by acuity is meant the capacity 
to distinguish (resolve) very fine or 
very close details. In general, the 
suthicient stimulus for the rod recep- 
tors in terms of radiant energy is 
lower than that associated with the 
cones, so that except for the red end 
of the spectrum, where sensitivity is 
about equal, rods are more sensitive 
than cones. Chapanis (13) refers to 
experiments which indicate that while 
a given level of dark adaptation al- 
lows light of a certain luminance to 
be perceived (sensitivity) it does not 


109 





110 


insure the ability to discriminate 
forms at this luminance _ level 
(acuity). Also, under scotopic condi- 
tions the retinal region of maximum 
sensitivity does not correspond to 
the region of maximum acuity. Later 
we shall consider certain structural 
features which contribute to acuity 
and sensitivity. 

Many factors have been shown to 
influence the degree of resolution: the 
luminance and size of the back- 
ground, surround luminance, wave- 
length, pupil-size, exposure time, 
retinal area stimulated, state of 
adaptation, etc. One is first inclined 
to attempt an explanation of acuity 
in terms of the projection of a geo- 
metrical image upon the retina. This 
interpretation must be rejected for 
two cardinal reasons. First, owing 
to its distortion the retinal image is 
far from being a pictorial replica of 
the visual world; and second, the 
retinal receptor mosaic, fine as it is 
in the central fovea, is not fine 
enough to account for the degree of 
visual resolution possible. However, 
as we shall presently indicate, a finer 
receptor ‘“‘grain’’ might not give rise 
to greater acuity at all. 

Let us these factors of 
image blurredness and receptor size 
in somewhat greater detail. The 
image is brought to focus on the 
light-sensitive retina by the cornea 
and the delicately adjusted accommo- 
dation of the lens. The pupil responds 
to the luminance of the field of view 
by widening or narrowing. At this 
point any analogy to a camera must 
be abandoned. There is a spread of 
light from the geometrical image of a 
bright object to its dark background 
(or into a dark object from a brighter 
background) owing to the diffraction 
of light by the pupillary aperture, 
as well as to the spherical and chro- 
matic aberrations. For a pupil size 
of 2.5 mm. or less, the spread of light 
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is attributable almost entirely to 
diffraction. For larger pupil sizes, 
the aberrations become important 
factors. Acuity has been found to 


increase up to a pupillary diameter 
of 2 mm., remain constant from 2 to 4 
mm., and then decrease owing to 
chromatic aberration. 

There are still further respects in 
which the eve is quite unlike a cam- 


era. It is not light-tight, and allows 
light to pass through the sclerotic 
coat surrounding the cornea. There 
is a further diffraction and scattering 
of light in the internal media of the 
eye. Light must pass through retinal 
blood vessels, nerve fibers, and cell 
bodies in order to reach the rods and 
cones. And there is reflection from 
the formed image on the retina to all 
other parts of the retina. Bartley 
(6, p. 58) claims that under many 
“the level of stray illu- 
mination is a considerable fraction of 
the intensity of the image itself.” 
Further, ‘“‘with a single disk [as 
image], the major factor producing 
the electroretinogram is fluctuation 
in stray light. When stray light is 
constant, the conditions for produc- 
ing a series of waves in the electro- 
retinogram are absent” (7, p. 926). 
The photographic plate has an 
even distribution of light-sensitive 
substance upon its surface, while the 
density of rods and cones varies ac- 
cording to retinal region. There are 
not only regional variations in the 
type, size, and spacing of the retinal 
receptors, but also the initial retinal 
“orain’’ is substantially altered by 
various synaptic relationships (e.g., 
the funneling of a group of rods onto 
one ganglion cell). Again, the image, 
the points of which are composed of 
small ‘“‘blur circles,’ is focused ex- 
actly only for a small central region, 
the more peripheral areas of the 
image being somewhat out of focus 
(17). For cone vision it has been 
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found that light passing into the eye 
obliquely at the edge of the pupil is 
a less effective stimulus than light 
entering through the center of the 
pupil. This directional sensitivity is 
termed the Stiles-Crawford effect 
and has been explained (57, 58) by 
considerations of shape, re- 
fractive index, and locus of photo- 
sensitive pigments. The adaptive 
range of the eye is over 1,000,000 to 
1, but pupil area varies only through 
a 16-to-1 range. Clearly, adaptation 
must be explained by some additional 
Also, the Stiles-Crawford 
effect has shown that even this pupil 
range has less control over alterations 
in retinal illuminance than was for- 
merly assumed (54, p. 403). 

To this complex picture of illumi- 
nance distribution we may add the 
fact that the eves are in a state of 
constant fluttering motion termed 
physiological nystagmus. This fine, 
rapid tremor is to be distinguished 
from the larger saccadic “flicks” and 
‘drifts’ (18). Physiological 
nystagmus occurs since the eveball is 
held in balance between pairs of an- 
tagonistic muscles, and unlike the 
larger movements, it is not binocu- 
larly coordinated. The measurements 
of tremor amplitude do not agree too 
well among the various experiment- 
ers, but the effect itself is well estab- 
lished. 

Thus, despite accommodation of 
the lens for maximum acuity, we 
find that the image delivered to the 
retina is blurred and distorted by 
many factors: diffraction at the pu- 
pil, spherical and chromatic aberra- 
tion, passage of extraneous light 
through the sclerotic coat, diffraction 
and scattering of light in the internal 
media, the imposition of blood vessels 
and neural tissue between incoming 
light and the receptors, reflection 
from the formed image, differential 
densities of receptor elements, lack 


cone 


factors. 
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of sharp focus except at the image 
center, the Stiles-Crawford effect, 
and physiological nystagmus.” 

We must now consider the second 
factor arguing against the interpre- 
tation of acuity by the geometry of 
images. This is the set of structural 
characteristics of the retinal mosaic 
in the light of acuity performances. 
According to the geometrical inter- 
pretation, for resolution of the details 
of a pattern to occur, the image 
formed on the retina must not havea 
fineness of grain exceeding that ex- 
hibited by the diameter of the individ- 
ual retinal receptors. But Hecht 
and Mintz (41) found that a thin, 
dark line subtending 0.5” of visual 
are could be resolved. This is roughly 
equivalent to 1 16-inch 
wire at a distance of half a mile. By 
the geometry of the situation, the 
image of this line (uncorrected for 
diffraction, etc.) would cover only 
about 1/40 of the diameter of the 
smallest foveal cones. The smallest 
cones in the fovea centralis are 1 to 
1.5u in diameter and therefore sub- 
tend a visual are of about 12” to 18” 
which is considerably larger than the 
0.5” minimum visible found by Hecht 
and Mintz. How is it possible for the 
eve to discriminate details finer than 
One reason is that 
the 0.5” geometrical image is a fic- 
tion; the sharpest edge will spread its 
blurred image over at least four or 
five foveal Also, it has 
been suggested that within certain 


seeing a 


its own mosaic? 


receptors. 


2 Some additional factors producing dis- 
tortion of the retinal image are given by 
T'schermak-Seysenegg (76, pp. 7-10). Lest the 
amount and complexities of retinal image dis- 
tortion appear too great for fine resolution 
ever to occur, it should be noted that the eve 
possesses certain mechanisms which help in 
correcting the aberrations: the differential 
curvature and density of the lens compensates 
for spherical aberration, while its yellow color 
reduces chromatic aberration by eliminating 
the ultraviolet end of the spectrum. 
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limits, rate of firing may be ‘“‘trans- 
posed ...into an ‘angular value,’ 
that is, interpreted dimensionally: 
low intensities as small, high inten- 
sities as large, angular sizes’ (64, 
p. 429). 

Summarizing, we may say that 
there are at least two problem para- 
doxes in the explanation of visual 
resolution. The image formed upon 
the retinal mosaic appears to be a 
rapidly vibrated blur. The mosaic 
itself, tight-packed with fine cells 
in the foveal region, is still too coarse 
geometrically to account for the 
fineness of visual acuity. However, 
in the conjunction of these two para- 
doxes investigators have seen various 
ways of solving both of them. These 
lines of thought will be discussed 
under the somewhat arbitrary head- 
ings of ‘“‘static theories’ and ‘‘dy- 


namic theories.” 
There are certain major factors in 
the problem of visual acuity which 


should be kept in mind, since it is 
interesting to see how the various 
theorists handle them: (a) the blurred 
character of the retinal image, (6) 
the continuous motion of the image 
due to tremor, (c) the diameter and 
spacing of the visual cells, (d) the 
modes of connection of the visual 
cells to the optic nerve, and the way 
in which the rest of the visual system 
is assumed to affect acuity. 


THE SIZE AND DISTRIBUTION 
OF RETINAL RECEPTORS 


This section is intended to give the 
reader the barest outline of anatomi- 
cal features in the retina required to 
comprehend the theories to be dis- 
cussed. Other material of this nature 
will be discussed in the context of 
the theory utilizing it. 

The structural limitations of visual 
resolution, aside from the blurring of 
the image, begin with the size and 
separation of the receptor elements 
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in the retina. Since rods are absent 
from the fovea centralis and are not 
implicated in photopic acuity they 
will be ignored in preference to the 
cones in this discussion. 

The thinnest cones are present in the very 
center of the fovea, where their thickness, in 
Man, is reduced to almost 1, corresponding 
to a visual angle of approximately 12” to 15” 
of arc.... The central territory, where the 
cones are almost uniformly thick, measures 
approximately 100u across, corresponding to 
oe a. ae It contains approxi- 
mately 50 cones in a line. This area seems to 
be not exactly circular but elliptical, with the 
long axis horizontal, and may contain alto- 
gether 2000 cones... here practically every 
cone is individually linked with the ganglion 
cells by means of its own ‘‘private’’ midget bi- 
polar... it represents an independent func- 
tional unit... the size of each of the 2000 
receptor-conductor units measures, on the 
average, 24” of arc (64, pp. 425-26). 


At the edge of the fovea, the num- 
ber of cones to 100 (20’ of arc) is 
reduced to 30, and their size increases 
to 40” of arc. This trend continues 
as we move further and further into 
the periphery, the cones becoming 
larger and more sparse. 

The cones of the fovea are mostly 
hexagonal, and ‘“*...in certain di- 
rections there are rows arranged in 
straight or nearly straight lines; in 
other directions this arrangement is 
less pronounced or is altogether ab- 
sent. But even where regular, the 
rows are straight on short stretches 
only, soon becoming a tangle of rows 
and groups of cones oriented differ- 
ently” (64, p. 427). 

The cones are separated by thin 
partitions of neuroglia, the thick- 
ness of which is less than the diame- 
ter of the thinnest cone. These parti- 
tions vary from }3u or 3 in the very 
center to }u (6” of visual arc) in the 
periphery. 

The fovea. itself consists of a flat, 
shallow pit in the retinal surface in 
primates. The particular visual pur- 
pose served by this arrangement is a 
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subject of debate among experts. For 
two divergent views see Polyak (64, 
pp. 209-210) and Walls (77, pp. 183- 
184). 

As might be suspected, the visual 
icuity of particular retinal areas is 
losely associated with the cone den- 
sity there (see 64, Fig. 99). Recent 
evidence (44, 57) indicates that our 
highest acuity is confined to a central 
region 354 in diameter (7’ of arc). 
Just beyond this tiny region (26 
cones across, containing a total of 
approximately 600 cones) there is a 
5 per cent decrement is acuity. 


STATIC THEORIES 


Intensity Discrimination 
(LTariridge) 


Theory 


Hartridge (35) presented details of 
his theory of visual acuity in 1922 


ra or 


and apparently still holds to the 
main outlines of it, although admit- 
ting that it Was wrong in its quantita- 


tive aspects (36). The point of de- 
parture consists in considering the 
acuity task to be a particular case of 
intensity discrimination. The blurred 
retinal image is regarded as a distri- 
bution of intensities. With a mini- 
mum visible bright line as test-object, 
the cone at the center of the image 
will receive light of greater intensity 
than those cones on either side of it. 
This threshold intensity difference 
was calculated by means of Ray- 
leigh’s equations, which take account 
of the diffraction and aberration char- 
acteristics of light, and was found to 
approximate 10 per cent. Thus, the 
eye actually is considered to turn the 
blurredness of the retinal image to its 
advantage. In contrast to the geo- 
metrical interpretation of visual acu- 
ity, Hartridge’s theory posits that 
“.. the diffraction pattern of an 
object formed on the retina must not 
exhibit differences of intensity which 
are below the threshold of difference 
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perception of the retinal receptors” 
(36). As shown in Figure 1, the only 
requirement for resolution of the 
minimum separable gap is that the 
illuminance incident at the central 
cone row be perceptibly different 
from that received by rows on either 
side. 


Intensity Discrimination 


(Hecht) 


Hecht with the fact that 
minimum separable acuity varies 
with the intensity of illumination, 
and utilizing the analogy of the pho- 
tographic plate, states that the fine- 
ness of detail which the retina can 
register is dependent upon the density 
of active receiving elements. By 
combining this conception of the 
visual system with the assumption 
that the sensitivity of receptors is 
randomly distributed,* Hecht (39) is 
able to give an account of the increase 
in acuity with increased field lumi- 
nance. The integral of the Gaussian 
distribution curve is an S-shaped 
function which is fitted to Kénig’s 
intensity-acuity data. According to 
Hecht (38, 39) an increase in photic 
intensity will give rise to an increased 
number of retinal elements which are 
functional, théreby effecting a reduc- 
tion in the average center-to-center 
distance of active elements and in- 
creasing the resolving power of the 
surface. Visual acuity, then, “varies 
directly with the number of func- 
tional rods or cones in a unit area of 
illuminated retina’ (38). 

Like Hartridge, Hecht considers 
the image on the retina as a pattern 
of intensities. If one row of cones is 
subjected to an intensity difference 
amounting to only 1 per cent with 
respect to adjacent rows, then this 
difference will be perceived. Since 


Theory 


starts 


3 


lhis is assumed for both rod and cone 
populations, although the rods are regarded as 
having lower thresholds in general. 
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only one cone row is differentially 
affected, the line (in the case of the 
minimum visible) will be sharp 
rather than fuzzy* (41). Also the 
intensity difference between the line 
and the background retinal illumi- 
nance is imposed on different ‘‘center 
rows” of cones in a short time interval 
owing to eye movements. 

Finally, Hecht and Mintz (41) 
note that an intensity difference of 1 
per cent is near the limit of differen- 
tial intensity discrimination. They 
suggest that the just resolvable visual 
angle varies with light intensity in 
the same fashion as the capacity to 
discriminate intensity differences var- 
ies with intensity. 

Critique of Hecht's theory. In view 
of the wide influence which this 
theory has exerted upon current con- 
ceptions of visual acuity, it may be 

* From what we have already noted above 
concerning Polyak's denial of the existence of 
straight “cone rows” it would seem necessary 
to explain the sharpness of fine resolution upon 
some other basis. 
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well to indicate some of the evidence 
which fails to support it. 

Byram (12) has pointed out that 
the equations which Rayleigh de- 
rived for the distribution of light in- 
tensity on the retina in the case of a 
long, black line on a uniformly bright 
field are valid only for a rectangular 
aperture. Since the pupil is circular, 
the calculations of Hecht and Mintz 
err by 15 per cent. Hartridge’s error 
amounts to some 200 per cent. 

Lythgoe (49) has shown that the 
classical, sigmoid intensity-acuity 
curve obtained by KGnig errs at both 
the high and low ends owing to a 
failure to maintain a surround lumi- 
nance comparable to that of the test- 
object background. Lythgoe was 
able to reproduce Konig’s results at 
the high end (a leveling off or slight 
falling off of acuity at high task 


5 The reader who desires additional critical 
material on Hecht’s theory should consult the 
excellent review articles by Senders (70) and 


Walls (78). 
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luminances) with inadequate — sur- 
round luminance; but when adequate 
surround luminance was maintained, 
visual acuity continued to increase 
with luminance of the 
acuity task. Thus, any theory which 
views visual acuity as 
solely in terms of brightness dis- 
crimination (e.g. Hecht’s) is faced 
with the tollowing problem: the dif- 


increasing 


explicable 


ferential brightness threshold ceases 
to improve at high brightnesses (and 
perhaps deteriorates) while acuity 
continues to increase up to the high- 
est luminance value used by any ex- 
perimenter! Moon (54) describes a 
study by Eguchi in which acuity con- 
tinued to rise up to values of lumi- 
nosity twice as great as that of white 
paper in direct noon sunlight. Moon 
comments further that . neither 
Lythgoe nor Eguchi succeeded, at 
the highest values of [!luminosity], in 
illuminating the surround to the 
luminosity of the test-object back- 
ground. Thus we have no proof that 
the straight-line 
continue upward 


relation maw not 


considerably —be- 


yond the point at which even the 


Eguchi tests indicate a bend” (54, 
p. 435). 

Hecht’s theory apples to Konig’ 
curve which is flat-topped and does 
not correctly represent t! 
acuity relation under more ade- 
quately controlled conditions. Like- 
wise, at the low-intensity end of the 
curve Lythgoe (49) obtained no flat- 
tening-out, a result which indicates 
that the sharp discontinuity of the 
rod and cone limbs of the curve may 
not actually occur. 

The assumption of a random dis- 
tribution of thresholds for 


le intensity 


retinal 


elements has been called into ques- 
tion by many investigators. 
have been willing to accept the view 
that cone thresholds vary so widely. 
In the light of the results of Lythgoe 
and Eguchi one would be forced to 


Few 


VISUAL 


ACUITY 115 
conceive of the retina as a sensitive 
surface in which the thresholds of 
some of its elements have never been 
reached. Fine as the retinal grain is, 
the theorist is hard put to explain 
the greater fineness of acuity  per- 
formances, let alone ruling out a 
considerable part of the mosaic for 
having thresholds so high that they 
would seldom (if ever) be attained. 
Hecht has been criticized for all but 
disregarding an important dimension 
of neural action, namely, that the 


frequency of neural discharge varies 


with stimulus intensity. Although 
there is undoubtedly some variation 
among thresholds, the most 
likely candidate for the representa- 


tion of intensity level is the relative 


cone 


discharge rate passed on to optic 
nerve fibers rather than changes in 
the absolute number of receptors ex- 
cited (3, 33). The receptor responds 
in a graded rather than in the all-or- 
none fashion Hecht assumes. 
Actually, the assumption of widely 
separated thresholds in percipient- 
element populations gives rise to 
rather peculiar implications. 
For example, consider the discrimina- 
tion of a small, dark spot on a moder- 
ately illuminated background. The 
dark area will produce an effect of 
illuminance decrement on a number 


some 


of cones and the dark spot will be per- 
ceived. But according to Hecht's 
theory there must be many such areas 
under moderate illumination condi- 
tions since the thresholds of many of 
the cones have not yet been reached. 
Phe homogeneous background would 
then appear as a_ rather spotty, 
blotchy expanse in which the retinal 
representation of the true dark spot 
would be indistinguishable from small 
clusters of elements with higher 
thresholds. Needless to say, such 
circumstances do not exist. 

The Hecht theory is unifactorial— 
only intensity is assumed to affect 
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acuity. Other factors influencing 
acuity performances have been neg- 
lected (e.g., size and shape of the 
test-object). The fact that increasing 
luminance gives rise to only a slight 
change in vernier acuity (aligning 
power) presents a further diff: culty 
for the theory (1, 9). Both Berger (8) 
and Ogle (61) note that with two 
point sources on a dark background 
the minimum angle of resolution in- 
creases (resolving power decreases) 
as the luminance of the points is in- 
creased. Ogle (61) has explored this 
effect using various background lumi- 
nances and concludes: “‘. . . it would 
appear that Hecht’s theory for the in- 
crease of visual acuity with illumina- 
tion does not apply, because the 
MAR [minimum angle of resolution] 
was apparently a function of lumi- 
nance ratio or contrast only.’’® 
Finally, O’Brien and O’Brien (60) 
have subjected Hecht’s assumption of 
large, fixed threshold differences 
among receptor elements to a fairly 
direct test. Hecht’s theory requires 
a variation among cone thresholds 
greater than one thousandfold. These 
investigators, using a double star 
test-object, contrived to concentrate 
approximately three-quarters of the 
luminous flux of the star image upon 
only three foveal cones. Consider- 
ing diffraction, etc., this degree of 
concentration is probably close to 
the maximum possible for acuity 
tests. At surround luminance from 
.01 to 100 foot-lamberts, “the star 


6 Parenthetically, it is worth mentioning 
that visual acuity is a complex function of a 
host of conditions. We choose to view movies 
in the dark where our acuity is poorer only 
because the contrast conditions are so much 
better (48, p. 260). But on the other hand, 
Ogle (61) has shown that visual acuity de- 
teriorates with increased point luminance on a 
constant background (increased contrast) and 
improves with increased luminance of back- 
ground when the point luminances are con- 
stant (decreased contrast). 
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illumination necessary for visibility 
of both stars was never more than 
twice the illumination at = which 
neither was visible. It is concluded 
that the small variation among foveal 
cones here observed is inconsistent 
with the Hecht theory.” 


DYNAMIC THEORIES 


The ‘‘dynamic’’ theories of visual 
acuity differ from the “‘static’’ theo- 
ries in One important respect: while 
Hartridge and Hecht give scant at- 
tention to eye movements, the physi- 
ological nystagmus (continuous, fine 
ocular tremor) is given an important 
role in augmenting resolution per- 
formances by certain theorists. 


The Theory of Weymouth et al. 


This theory is an attempt to ex- 
plain how vernier acuity operates 
with such remarkable sharpness and 
accuracy. Hair lines appear straight 
and clear despite the irregular orien- 
tation of “cone rows,’ and vernier 
acuity is accurate to 2”, or somewhat 
less than one-tenth of the width of a 
central foveal cone. Weymouth and 
his co-workers (4, 82) make use of two 
assumptions which were held by I !er- 
ing: first, that each cone has a local 
sign, and second, that physiological 
nystagmus contributes to acuity. 
Actually, the second of these is not 
an assumption in the strict sense for 
these investigators, since it is derived 
from some more basic aspects of the 
situation at the retina. If two adja- 
cent cones are stimulated they are 
assumed mutually to affect each 
other’s local sign, the net effect being 
to shift the pooled local signs to some 
intermediate position. Thus, nystag- 
mus would give rise to a process of 
successive averaging, the product of 
which is termed “retinal mean local 
sign.” 

The following, then, is a picture of 
vernier resolution as it would be ac- 
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counted for by this theory. Any cone 
“touched” by light is assumed to be 
stimulated (4). Consider an edge be- 
tween the light and dark portions of a 
visual field. The cones which fall 
along the retinal image of such an 
edge would form a staggered and ir- 
regularly spaced line, as photomicro- 
graphs reveal. The mutual influence 
of the local signs of such cones will 
serve to straighten this edge some- 
what, and the horizontal nystagmus 
movements sweeping this diffraction 
edge over a wider strip of cones will 
add to this dynamic averaging proc- 
Since equal numbers of stimu- 
lated cones will come to lie in all 
parts of the strip swept out by the 
diffraction edge under the control of 
nystagmus, the center of this strip 
will represent the “center of gravity” 
of all the stimulated cones. 


ess. 


The average or mean of these points, which 
determines the local sign of the straight line, 
is therefore not restricted to such units as inter- 
conal distance or cone diameter but may be ac- 
curate to a small fraction of these units just as 
the mean of a number of measurements made 
in inches may be accurate to a small fraction 
of an inch (4, italics mine). 


kor the perception of the vernier 
offset to attain maximum accuracy 
the lines must be long enough so that 
a sufficiently large sample of cones 
may be averaged. It is a fact that 
increasing the line length up to a cer- 
tain limit does aid vernier acuity. 
According to this view, acuity is rela- 
tively independent of the size and 
spacing of the retinal elements, the 
dynamic averaging process func- 
tionally producing a finer “grained” 
mosaic for line representation and 
alignment. 


The Theory of Marshall and Talbot 


The aim of these authors is to de- 
velop a theory of visual acuity based 
upon factors of the nervous system 
which have presented embarrassing 
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complications to past theories rather 
than helpful mechanisms. This is the 
only current acuity theory which in- 
volves the anatomy oi the visual pro- 
jection system up to Brodmann’s 
Area 17, as well as many neurophysi- 
ological phenomena ignored by pre- 
vious theories. Just as the blurring 
of the retinal image by diffraction, 
aberrations, etc. was utilized by the 
‘static’ theories to explain the ex- 
treme fineness of acuity, Marshall 
and Talbot (53) carry the process on 
several steps by taking account of 
various neurophysiological phenom- 
ena such as facilitation, summation, 
the neural recovery cycle, overlap 
and multiplication of pathways. 
First, let us consider two anatomi- 
cal factors of great importance to the 
theoretical presentation: reciprocal 
overlap and multiplication of path- 
way. The principle of reciprocal over- 
lap is easily grasped by relating the 
diagrammatic representation of Fig- 
ure 2 to the following description: 
‘“‘In the cat, optic tract endings in the 
geniculate divide’ into — several 
branches and as many as 40 ring- 
shaped boutons have been seen on 
single radiation cells which may come 
from as many as 10 optic tract fibers” 
(53). Multiplication of pathway 
simply refers to the fact that in the 
monkey, for example, ‘‘the optic tract 
fibers seem to divide into 5 or 6 
branches, and each branch ends in a 
bouton which makes contact with a 
different radiation cell, resulting in 
multiplication of the transmitting 
path” (53). Such multiplication, as 
the visual system ascends to the cor- 
tex, provides a probability-distribu- 
tion of cortical cells for each foveal 
cone. Each receptor, then, may acti- 
vate different paths to different corti- 
cal cells at different times. This ex- 
pansion or refinement of ‘‘grain”’ 
from retina to cortex is independent 
of the functional and anatomical 
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spread due to reciprocal overlap. 
While it is admitted that overlap is 
minimal for neurons ascending from 
the fovea centralis and is more char- 
acteristic of the periphery, it is never- 
theless maintained on both anatomi- 
cal (64, p. 430) and experimental (30) 
grounds that possibilities for such 
interaction exist. 

As the retinal receptors are vi- 
brated back and forth across the con- 
tours of the diffraction image of a 
test-object by the small-amplitude, 
high-frequency action of physiologi- 
cal nystagmus, they are subjected to 
rapidly changing illuminance values. 
Not only the amplitude of the illumi- 
nance change but also its rate of 
change will determine the magnitude 
of receptor outputs. The experiments 
of Hartline and Graham (33) on the 
photoreceptors of Limulus have 
shown rate of stimulus onset as well 
as intensity to be an important de- 
terminant of high discharge fre- 
quency. Further, this photoreceptor 
initially discharges at a high fre- 
quency when stimulated but rapidly 
adapts to a lower discharge rate de- 
spite the constancy of the stimulus. 
Ocular tremor, then, would serve to 
maintain a high neural discharge rate 
by imposing sudden illuminance 


changes to many receptor ‘‘rows”’ 
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over frequently occurring cycles. 
Nystagmus is assumed to fluctuate 
“with approximately a statistical oc- 
currence of various amplitudes” (53). 
Thus the small illuminance difference 
presented by a hair-line test-object 
might be made liminal by the rapid- 
ity of its onset over sets of receptors 
which are stimulated intermittently, 
thus by-passing the effects of neural 
accommodation. As Marshall and 
Talbot (53) remark: ‘This essential 
discontinuity of stimulation is neces- 
sary for continuous vision through a 
fatiguable system.”’ 

We are now in a position to illus- 
trate how the above factors, in com- 
bination with the excitability changes 
of the neural recovery cycle, might 
work to produce certain gradients of 
excitation which progres- 
sively refined and sharpened as they 
are passed on to higher synaptic 
levels. Confining our discussion to 
the retinal level for the present, we 
shall three whose 
salient features and modes of opera- 
tion have been described above: (a) 
the intensity distribution (cf. diffrac- 


become 


consider factors 


tion, etc.) of the retinal image, 
termed the diffraction image, (>) 
physiological nystagmus, and (c) 


rate of illuminance change. 
In Figure 3 the intensity distribu- 
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Fic. 3. THe DIsrrRiBUTION OF 


ILLUMINANCE 


ON THE RETINA AcROSS THE GEOMETRICAL 


BounbDARY (MN) SEPARATING LIGHT AND DARK HALVES OF A FIELD (MODIFIED FROM JONES 


AND HIGGINS 


tion of the diffraction image of a bor- 
der between the light and dark sides 
of a bipartite field is shown (curve 
A). The geometrical border is MN. 
This figure is based upon one given by 
Jones and Higgins (44) who apply 
some of the conceptions of Marshall 
and Talbot in their treatment 
photographic granularity and graini- 
ness. B indicates a_ hypothetical 
foveal cone distribution (a center-to- 


ot 


center distance of 1.54 is assumed). 
The distance between X and Y repre- 
the best estimate 
below) of the average amplitude of 
X and Y 
mark the limits of tremor for cone a. 
Thus, the diffraction image of the 
border MN is’ spread 
over the cones with the graded illumi- 
nance of curve A. Tremor XY will, 
by shifting the cones with respect to 
the diffraction image, subject them 


sents recent (see 


physiological nystagmus. 


geometrical 


to different rates of change of illumi- 
lor example, if the bank of 
cones shown in Figure 3 moves to the 
right from the position shown, cone 
a will the greatest rate of 
change of illuminance since it lies on 
the steepest portion of curve A. 
b, transecting a less steep portion of 
the illuminance distribution, will be 


nance. 


recel ve 


Cone 


subjected to a less rapid stimulus on- 
and produce a 
somewhat lower neural discharge fre- 
quency. From an inspection of Fig- 
ure 3, it can be seen that the middle 
—b) will receive the 
greatest over-all illuminance changes, 
with cone a, on the average, receiving 
the largest rate of illuminance change. 
Considering the complete cone bank 
B, the differences in both amplitude 
and rate of illuminance change aris- 
ing from ocular tremor will create a 
gradient of discharge frequency over 
the receptors which is peaked in the 
center. 

At the retinal level, then, the Mar- 
shall-Talbot theory provides one 
mechanism for the neural peaking of 
borders. Betore passing on to higher 
synaptic levels, it should be noted 
that, unlike the Weymouth theory, 
receptor size plays an important role 
here. Cone size is one factor deter- 
mining the rate of illuminance 
As Marshall and Talbot (53) 
put it: “Smaller receptors would be 
though traversing 
the optical gradient oftener, they 
would gather proportionally — less 
brightness differential. The limiting 
retinal factor in acuity seems to be 


set, consequently 


cones (d, b, 


change. 


useless, because 








120 


the relation of receptor width to the 
highest optical gradient in a moving 
pattern, rather than the average 
static differential illumination on one 
cone, compared with its neighbors.” 

The diffraction image, peaked at 
the retinal level by differential mag- 
nitudes and rates of illuminance 
change owing to the nature of the 
illuminance distribution and nystag- 
mus motion, is passed on to the lat- 
eral geniculate. By referring to Fig- 
ure 2, it can be seen that multiplica- 
tion of pathway and reciprocal over- 
lap will provide further peaking for 
submaximal reactions. If the system 
is not saturated the greatest reac- 
tion density will occur at the center 
of the reacting group, for it is here 
that both spatial and temporal sum- 
mation will be greatest owing to the 
peaked character of the in-coming 
discharge frequency pattern. Path- 
way multiplication and overlap serve 
to refine the mosaic ‘‘in proportion to 
the sharper gradients and peaks pro- 
duced, as sand forms sharper peaks 
than bricks” (53). 

To complete the picture, we must 
consider yet another neurophysio- 
logical mechanism: the neural re- 
covery cycle. Observations on the re- 
covery cycle at the lateral geniculate 
of the cat show that the second of two 
stimuli of weak or moderate intensity 
‘produces an enhanced postsynaptic 
spike, during the first 10 to 30 
msec.... This mixture of super- 
normality, facilitation, recruitment, 
or summation is succeeded by a 
longer period of depression or sub- 
normality” (53). The recovery cycle 
functions “to break up neural ac- 
tivity into temporally discontinuous 
transmission,” and the summation 
possible during the supernormal pe- 
riod will serve to amplify the gradients 
oscillating at the lateral geniculate 
level. Utilizing the observations of 
Adler and Fliegelman (2) on physio- 
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logical nystagmus, Marshall and 
Talbot (53) state that the periods 
and amplitudes of this action ‘yield 
velocities of transit across the mosaic 
which stimulate the receptors dis- 
continuously, at intervals important 
in the neural recovery cycle.” 

To clarify the manner in which the 
recovery cycle at the lateral genicu- 
late might serve to emphasize inten- 
sity differentials at borders, consider 
what might possibly occur as cone 
bank B (Fig. 4) oscillates back and 
forth. With the bank at the extreme 
right position (cone a at Y), cones 0, 
and ¢ have just recently given on- 
responses while cones —b and —c 
have responded somewhat earlier. 
As bank B begins to shift to the left 
on the second half of the cycle, the 
geniculate correlates of cones }, and 
c will be responding to the onset of 
illuminance on cones 6 and ¢ in the 
supernormal portion of their recovery 
cycle. Therefore, early in the second 
half-cycle of nystagmus, the genicu- 
late correlates will show summation 
(peaking) according to the Marshall- 
Talbot scheme. By the time cones 
—b and —c reach the steep portion 
of A their geniculate correlates will be 
in the subnormal phase so that the 
activity arising from these cones un- 
dergoes suppression at the genicu- 
late. On other cycles, the geniculate 
correlates of —6 and —c will be 
amplified while & and c¢ are sup- 
pressed. The correlate of a will tend 
to be somewhat suppressed com- 
pared to (b, c) and (—b, —c) since 
recruitment at the geniculate is in- 
versely proportional to the strength 
of the conditioning stimulus, and a 
receives, on the average, a stronger 
stimulus on its conditioning half- 
cycle than b, c, —b, or —c. This 
means that the gradients of activity 
at the borders of bar C will be sharp- 
ened and acuity enhanced. 

It has already been indicated how 
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Fic. 4. THE DISTRIBUTION OF ILLUMINANCI 
ON A DARK FIELD. 


the mechanism of pathway multi- 
plication and overlap in combination 
with a fluttering neural ‘‘image”’ 
would tend to produce ever sharpen- 
ing gradients at each synaptic level 
for lines and edges. The cortical 
mosaic is considered to be of a much 
finer grain than the retinal (see de- 
scription of pathway multiplication 
above): “ ... quantitatively the unit 
paths near central vision should now 
be conceived, not as lines, but as ex- 
panding cylinders whose ends bear 
an area ratio of 1:10,000, and a cellu- 
lar ratio of perhaps 1:100" (53). 
Visual resolution may be explained 
by differential receptor stimulation 
produced by ocular tremor ‘“‘scan- 
ning’’ which ultimately gives rise to 
peaks of activity in the finer cortical 
mosaic. Since cortical excitation 
peaks occur upon a greatly expanded 
scale, and after successive refinement 
and sharpening at other synaptic 
levels, the accuracy of spatial locali- 
zation will be more precise than that 
afforded by the coarser retinal mo- 
Reciprocal overlap “‘... while 
broadening the base of a local reac- 
tion, provides for small shifts of reac- 
tion peaks, if the system is not satu- 
rated" (53). Thus, stimuli having a 
spatial separation much smaller than 


saic. 


31s A BANK OF CongEs, XY 


\) ON THE RETINA FOR A BRIGHT Bar (C) 
tHE Limits OF PHYSIOLOGICAL NYSTAGMUS 


the center-to-center distance  be- 
tween adjacent foveal cones will be 
resolved ; the center-to-center 
distance of cones at the fovea cen- 
approximately 20” while 
vernier acuity is of the order of 2”. 


e.g., 


tralis is 


An EvALUATION OF FACTORS 
INVOLVED IN THE MARSHALL- 
TALBOT THEORY 


Since various aspects of the Mar- 
shall-Talbot theory have been uti- 
lized to account for other visual phe- 
nomena besides acuity (see 37, 44, 
62, 63), and because it is the most 
comprehensive and sophisticated acu- 
ity theory to date, an evaluation of 
the major physiological mechanisms 
upon which the theory rests will be 
attempted. 

Physiological Nystagmus and Acuity 

The  small-amplitude, — high-fre- 
quency, involuntary flutter of the 
eyes during ‘‘steady”’ fixation plays 
an important role in dynamic theories 
and particularly in the Marshall-Tal- 
bot theory. Recent evidence on the 
characteristics of this motion will be 
described, and relevant aspects of the 
theory will be evaluated in the light 
of this material. Experiments at- 
tempting to trace the influence of 
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fixation flutter upon acuity will be 
discussed. 

Adler and Fliegelman (2) describe 
physiological nystagmus as having a 
mean amplitude of 2’ 14” and a fre- 
quency of 50 to 100 per sec. It is 
these data which are used by both 
Marshall and Talbot (53) and Jones 
and Higgins (44) in their theoretical 
treatments. Ratliff and Riggs (66) 
recalculated the data presented by 
Adler and Fliegelman and arrived at 
a value of approximately 1’ for mean 
tremor amplitude. In their investiga- 
tion, Ratliff and Riggs (66) found the 
median tremor to be 17.5”, with a 
range from just perceptible move- 
ments to almost 2’ of arc. Tremors 
greater than 1’ were rare. The fre- 
quency ranged from 30 to 70 cycles 
per sec. Other measurements of phys- 
iological nystagmus by Barlow (5) 
and Ditchburn and Ginsborg (18) are 
in good agreement with these values. 
However, another recent investiga- 
Higgins and Stultz (43) 


tion by 
yielded a median tremor amplitude of 


1.0’, with a mean at 1.2’. In terms 
of Polyak’s histological description 
of the fovea centralis (see above), 
the difference between the results of 
Ratliff and Riggs (66) and Higgins 
and Stultz (44) is roughly equivalent 
to a flutter motion across one cone 
versus motion across four cones. In 
Figures 3 and 4 we have assumed a 
tremor motion across 4 
illustrative purposes, in accordance 
with the findings of Higgins and 
Stultz. This may or may not be an 
overestimate, but in any case, the 
combination of tremor, drift, and 
flick movements make it appear 
“unlikely that any point of an image 
can remain on a single receptor for 
more than a few hundredths of a 
sec."’ (66). There do not seem to be 
any differences in the amplitude or 
frequency of tremor under changes in 
test-object shape (66), visibility, or 


cones, for 
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monocular versus binocular fixation 
(43). This squares with Walls’ (79) 
suggestion that the tremor is ‘‘due 
to proprioceptive feedback from the 
muscles, going no higher than the 
cranial-nerve nuclei —themselves.”’ 
Summarizing: Investigations 
fairly good agreement on the fre- 
quency of ocular tremor, but the 
amplitude is certainly smaller than 
the 2’ 14” estimate of Adler and 
Fliegelman (2). Ratliff and Riggs 
(66), using a contact lens arrange- 
ment, obtained amplitude values less 
than one-third as large as those found 
by Higgins and Stultz (43) who 
directly photographed the excursions 
of a small scleric blood vessel. Such 
measurements are technically difficult 
and the source of the experimental 
differences may lie in the type of re- 
cording device utilized. 

The effect of an over-all smaller 
tremor amplitude upon the Marshall- 
Talbot theory is difficult to estimate. 
Barlow (5) feels that his small ampli- 
tude results are evidence against the 
theory, while Ditchburn and Gins- 
borg (18), with comparable findings, 
regard them as compatible with (al- 
though not directly supporting) the 
Marshall-Talbot theory. These lat- 
ter investigators also note that the 
tremor is highly irregular; Marshall 
and Talbot assume a statistical dis- 
tribution of the amplitudes. We have 
already explained how they also con- 
sider the tremor characteristics, as 
described by Adler and Fliegelman 
(2), to be rather precisely related to 
the neural recovery cycle at the lat- 
eral geniculate. However, Marshall 
and Talbot do not work out any ex- 
plicit, quantitative relations here, 
and in another source (52) describe 
the relationship in a way which 
seems to involve other eye move- 
ments besides tremor. On the face of 
it, the posited mode of operation of 
this mechanism seems highly im- 


show 
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probable, for a statistical occurrence 
of both horizontal and vertical com- 
ponents of tremor 
drifting movements 
mesh with the phases of the recovery 
cycle over the area thus involved. 
Before the various movements could 
up” a sharpened set of 
with the aid of 
a large saccadic flick would 


imposed upon 


can scarcely 


‘average 
gradients 
cycles, 


recovery 


move the crucial portions of the dif- 
fraction image to a new portion of the 
cone mosaic. 

But ocular tremor is also made to 
serve three other purposes: (a) The 
refinement and peaking of gradients 
brought about by physiological nys- 
tagmus interacting with reciprocal 
overlap at higher levels (independ- 
ently of evcle dynamics 
would presumably still be operative 
for tremor of 1’ amplitude. b 
Such image flutter could still average 
up a more precise “retinal mean local 
over time. (¢ 


Trecoy ery 


sign” Ocular tremor 
is also assigned the role of converting 
the spatial illuminance gradients of 


the diffraction image into temporal 


illuminance gradients on the recep- 


tors. A fluttering motion would 
provide rapid rate changes in stimu- 
lus strength, thus high 
neural outputs and, as a result, bet- 


ter acuity. 


producing 


There has been no direct experi- 
mental test of a, but 6 and « 
begun to be investigated. Riggs 
and Ratliff (68) describe experiments 
that contirm the role of 
some mean local 
binocular vision. Stereoscopic acuity 


have 


appear to 
sort of sign in 
measures reveal that a difference of 
2” of arc between the images on the 
eyes can be utilized as an accurate 
depth cue, yet 
cording to 
some 17.5” 
coordinated. 
ception — of 


median tremor ac- 
these investigators is 
and is not binocularly 

Apparently the con- 
corresponding — retinal 
points must be revised to include 
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temporal integration 
producing a mean retinal location. 
Riggs and Ratliff (68) conclude “that 
both spatial and temporal patterns of 
impulses the two must 
somehow be combined centrally.” 

O'Brien and O’Brien (59) describe 
an experiment which purports to 
eliminate tremor as a factor in visual 
acuity. However, the cogency of the 
results has justifiably ques- 
leuber and Bender (75). 
In a later paper, O’Brien (57) is of 
the opinion that final judgment on 
the tremor-acuity relationship should 
be deterred. 


processes of 


from eyes 


been 
tioned by 


That the edges are more important 
determinants of acuity than the area 
of the test-object has been revealed 
by a number of investigators (10, 21, 
47). The relation of this fact to 
tremor-scanning of gradients is ob- 
vious. 

Ratliff 65 
acuity 


tested monocular 
a test-object exposure 
in order to ascertain the 
effect of eve movements upon acuity. 
Drift movements during exposure of 
the found to be 
detrimental to acuity, and incorrect 
responses were significantly related 
to larger amplitudes of tremor. One 
might expect drift to decrease acuity 
by changing the location of the scan- 
tremor, but the 
fact that larger amounts of tremor are 
associated with’ poorer acuity does 
not contirm the “dynamic” theories. 
This result is compatible with static, 
intensity-discrimination theories in 
which acuity arises from the simul- 
ditferential 
set of adjacent receptors. 


during 


of 75 msec. 


test-obiect Ww ere 


operation ol 


ning 


taneous responses of a 
But that 
the functionally immobile eve is not 
only not the most eftticient condition 
for seeing, but is in general quite a 
poor one, has been recently demon- 
strated (69). It was not actually 
to immobilize the eve. 
Instead, an optical system was de- 


necessary 
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vised which provided for displace- 
ment of a vertical dark-line test- 
object in correspondence with any 
horizontal eye movement, so that the 
diffraction image would always lie 
upon the same retinal receptors. 
Under these ‘“‘compensated” condi- 
tions, fine test-objects disappeared 
after a few seconds and failed to re- 
appear. With a viewing time of one 
minute, maintenance of resolution 
was shown to benefit when eye move- 
ments had their normal effect, and 
was especially aided by ‘“‘exag- 
gerated”’ (the amplitude of image 
motion due to eye movements was 
doubled by altering the optical sys- 
tem) eye movements. When short 
exposure times comparable to Rat- 
liff’s (65) value of 75 msec. were used 
results similar to his were obtained. 
Here, the ‘‘compensated"’ (no image 
movement) condition yielded the 
best visual performance. With some- 
what longer exposure times, the 


normal and ‘‘exaggerated”’ conditions 
began to excel, and it is suggested 


that these conditions operate to 
“shift the acuity task from one set 
of receptors to another in rapid suc- 
cession so that not all of the receptors 
at any one time have achieved a 
stationary state ...eye movements 
are bad for acuity but good for over- 
coming the loss of vision due to 
uniform stimulation of the retinal 
receptors” (69, italics mine). 

The investigation described above 
(69) is in many respects crucial to 
the question of the role of ocular 
tremor in acuity. The finding that 
with short exposures the ‘‘compen- 
sated’ image was best for acuity is 
counter to the Marshall-Talbot 
theory. However, the rapid fading 
of fine lines under ‘‘compensation”’ 
with longer viewing time seems to 
indicate that once the retinal ‘‘on- 
response’ (32) to the presentation 
of the test-object is over it will dis- 
appear unless similar responses are 
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continuously evoked. Under normal 
viewing conditions, the fact that the 
‘“‘on”’ of short exposure, when coupled 
with the ‘‘on-responses’’ associated 
with diffraction image movement (due 
to tremor), should give rise to poorer 
acuity might be attributable to some 
“smearing” or confounding of the 
spatio-temporal relationships by the 
two simultaneous on-processes. Eye 
movements certainly produce supe- 
rior acuity performances when al- 
lowed to operate over an appreciable 
period of time. Therefore, it seems 
quite possible that high acuity is 
normally obtained and maintained by 
“on-off responses,’ which during 
prolonged viewing are continuously 
evoked by means of ocular tremor. 
Under normal viewing conditions, 
high short-exposure acuity might be 
obtained from the ‘‘on-response’’ of 
stimulus onset and fail to be main- 
tained by tremor due to “‘smearing” 
as explained above. This view of the 
processes producing acuity differs 
from that given by Riggs et al. (69) 
quoted above. We consider eye 
movements to be always an aid to 
acuity (since they provide the neces- 
sary temporal illuminance gradients) 
unless simultaneously confounded 
with other ‘“‘on-response’’ producing 
effects (e.g., flash exposure of the 
test-object). By ‘‘on-response’’ is 
meant the on-component of ‘‘on-off”’ 
optic nerve fiber activity (32). This 
will be given further consideration in 
a later section. 

It is possible to test the importance 
of ‘‘on-off responses”’ for acuity in a 
precisely controlled way by using a 
flickering test-object under ‘“‘com- 
pensated”’ conditions. With such an 
arrangement we would predict that a 
fine-line test-object would not dis- 
appear. This is just what occurs,’ 
but there are complications. Very 


7Tom N. Cornsweet. Personal communi- 


cation, Feb. 7, 1955. 





the neighborhood ot 


one CV le per sec. 


slow flicker in 
permits continued 
seeing of the test-object with no fad- 
ing out, but there is a rapid decre- 
ment in the percentage of time for 
which the test-object is 
flicker frequency is 
about 1.5 
studies on the cat 


when 
than 


Ele trophysiologi i 


seen 
greater 
cps. 
indicate that 
channel ‘on-off 


capacit) for 


sponses’ in the retinal ganglia 
and evoked cortical potentials 
than 1 cps, and 


SoMa eer 
flicker fusion 


is much higher 
the 
quency. If resolution is related to the 


proximates 
“on-off mech 
then 


lor an opti 


activation oj 
have suggested 


ditticult to 


{as We 


' 
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around 1 with 


! 


seeing condition cps 


“compensation” at marked dete 
hicher tre 


which are well within the c 


rioration with “quencies 


pare ities 


ft’ mechanisms 


Verh tps 


lateral 


od‘ot the 


Mmportant tactor 


Another piece of experimental evi- 
this 


dence seems to militate against 


view. Senders (71) found that when 


a flicker light source was used the 
test-object’) luminance” required 
threshold visual resolution to occur 
was less than would be required by 
the Talbot-Plateau law. Since ‘ce 


pensated” conditions were mn 


normal eye tremor presumably 


il flicker 
tremor-induced 
" 1] 


SHOU 


curred, ai “on-off res 


“on-ott 


beet 


plus 
sponses have 
founded iccording to 
above It is i 
nders’ by g 


of resolution 


seated 
note that S« 
occurs 
where Cornsweet, us! 
pensated” condition, finds 


ment. 


Evidence on Reciprocal Overlap 


Multiplication of Pathway 
The role of reciprocal overlap and 
multiplication of pathway in provid- 
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basis for visual 
the Marshall- 


Thev cite evidence 


the anatomical 
acuity is 
Palbot 
of overlap in the lateral geniculate 
of the cat and pathway multiplica- 
tion in 


ing 
central to 
theory. 


the monkey (see above 

With respect to the possibility of 
the the fovea 
is described (64) as fol- 


lows: 


situation at 
bv Polvak 


over] ip. 


nd even probable that 


il conditions the pure 
* cone—single midyet 
ganglion, the “private 
istic of the central 
ctions indepe ndent] 

ne system, even in the 
tivated withe 


is rarely ac it some 


itftuse neurons, such as 
this widening of the 
i l Wp l~es I 

the distinct- 
( | ind in 
mewhat the effect of the 


individual 


OCeSSes, 


separating the 


64, p. 431 


should be noted that 
claims the midget 


liowever, it 
Poly ik 
+} 


synapses 
eTIIS¢ Ives do not overlap recipro- 
ally. 

We have already distinguished be- 
tween sensitivitv and acuity, the one 
threshold 
to resolution 


referring to luminance 
phenome 

of a test-object Resolution depends 
upon, among other things, the diam- 
ig of the percipient 
the 
a small area at ihe 


‘rr ot the fovea groups of visual 


eter and 


Sparc IT 
elements Sut also involved is 


t] ial bevond 


receptors tend to converge upon the 


same bipolar, and many 


nnel onto the 


bipolars 
same @ inglion cell. 
rods especi lly converge in this 

many of them using the same 
pathway. Where this mode of 
tion occurs it effects a 


conners coarser 


" and thus reduces re- 
but 
summation 


(32). 


Whatever the situation is at 


“neural grain 
it enhances 
which 


solving power, 
1 


spatial increases 
sensitivity 
the 
retina, the great reduction in path 
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number at the optic nerve must be 
considered. [mn man, there are some 
125 million rods and 6 million cones 
in the retina. But there are at most 
only one million fibers in the optic 
nerve, and only some 80 per cent of 
these terminate in the lateral genicu- 
late. Any overlap or increase in 
“orain’’ which might in the 
retina will be undone (at least in any 


occur 


spatial sense) by the great Cegree of 
funneling at the optic nerve. 

For the primary 
the lateral geniculate nucleus ts the 


isual pathway, 


only relay center between the optic 
nerve and the cortex. It a lami- 
nated structure, and Walls (80) de- 
scribes its topological relations to the 


is 


retinae as follows: 


The problem of why 


geniculate nucleus] 


the LGN [lateral 
should ever be stratitied 
has seemed a mystery for half a century. In 
this time it has been brilliantly first 
that each LGN embodies an isomorph of one 
half of the binocular visual field and the ad- 
joining uniocular temporal crescent, with each 
peripheral or central quadrant of each of two 
hemiretinas projecting 
of the nucleus; seconc 


shown: 


to a specific 

1, that in the 
binocular carnivores and primates certain 
laminae receive only from the contralateral 
retina and the others from the ipsilateral; and 
third, that physiologically 
spots in homonymous hemiret 
sented bv cells scattered along a single “‘line”’ 
running through al! of the grav laminae like 
one of the toothpicks in a club sandwich. 
... But this knowledge helps us not one jot 
or tittle to see why the LGN should have 
lamination per not, if cell in any 
lamina communicates within the n 


segment 


tron 
STTO! 


gly 


correspondent 


is are repre- 


se no 
ucleus with 
any cell in any other lamina. No matter how 
closely the system may bring binocularly cor- 
respondent paths into approximation, if it is 
not for the purpose of enabling them to “‘fuse’’ 
in the LGN, then the approximation is sense- 
less; and, the lamination that brings it about 
continues to seem senseless. 


Thus, there appears to be no inter- 
laminary communication in the LGN, 
i1.e., no evidence for binocular inter- 
action here.’ This is given full cogni- 


® However, Harman (31) refers to unpub- 
lished work by Fox which indicates evidence 


] 


i 
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zance by Marshall and Talbot. In 
fact, using the cat, they were among 
the first to demonstrate it (73). There 
has been further by 
Marshall (51), although recently 
Bishop and Davis (11) report electro 
physiological evidence of bilateral 
interaction in the cat LGN. In any 
event, this particular aspect of the 
I.GN is not critical for the Marshall- 
Talbot theory one way or the other. 

Marshall and Talbot do utilize the 
observations of Glees (23) on the cat 
LGN. For both the cat and the rab- 
bit (24) Glees found multiplication 
of pathway and extensive reciprocal 
overlap within each cell lamina. Also 
cited by Marshall Palbot is the 
path-multiplication seen in the Ma- 
monkey LGN id 


(25), although nO 


contirmation 


1 
and 


by Glees al 
there 
evidence of reciprocal overlap. Here, 
fiber ended in a ot 
5 or 6 branches. The end boutons 
contacted the cell bodies and not the 
dendrites, and in all cases o 


caque 
Clark was 


each opth spray 


ily one 
bouton was related to one geniculate 
cell. Clark's (15) the 
probable function LGN 
structures is thought-provoking: 


comment on 


7 1 
ot these 


Clearly, 
key LGN] would pr 
ision in the reé 

f a retinal 


such an ar 


f the « 
seincansan cise 
1 presumably 


n anatomical basis for a high deg 


different optic oul pro 
vide a ree of 


sensitivity even in low intensities of 


nation 


1 above th: 
in the retina enhance 


It was note it funneling 
arrangements 
sensitivity at the expense of acuity. 
Clark extends the same function to 
the mechanism of reciprocal overlap. 
The overlap, characteristic of the 
nocturnal animal's LGN, is a struc- 


for interlaminar overlap in monk 


preparations. 


ey 


Golgi 
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t 
ty 
hi 


ture for increasing lig 
while the 
sharp-sight 


sensitivity, 
LGN of 
ed, diurnal animals main- 


t 
lineation in the 


tains opti 


| differentials for 
tr. 


nerve 
insmission to the cortex. 

For the human LGN only cell- 
count data are available, but 
theless Walls (80 


concludes 


none- 


more 

visual cortex 

shows that 

expanded rept tot 
panded representatl 
the visual 

Marshall (74 


Kkhesus monkev, 


pared with 
Talbot and 


on the 


work 
found 
of visual arc at the fovea (or 


project to 1 mm. of cortex, while just 
bevond the fovea the relationship was 
approximately 18’ per mm. of cortex. 
Clark (14) has shown that one 


vated spot on 


acti- 
the cortex (fired by one 
geniculostriate fiber) could spread its 
influence over a radius of some 5 mm. 
by means of short association fibers. 
This might effect a functional multi- 
plication and overlap in the cortex, 
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despite a paucity of terminal axonal 
arborization in geniculostriate fibers. 


1] Mechanisms 


It is perhaps surprising that Mar- 
and Talbot 
(32 


retina. By 


make scant use ol 
work on the verte- 
isolating and re- 
cording from single optic nerve fibers 
the bullfrog, Hartline was able to 
rou hin three 
* } 


ers 


types of 
numbered some 
ind responded to illumi- 

O sharp bacrease) by 
hort high-irequency burst of 
lowed by a slower steady 


scharge which 


. 
( main- 
long he stimulus con- 

\bout 3 ver cent of the 

“off” and 
stimulus 


as suddenly 


Was 


ype 
1 the 


de- 


o a less intense value. Some 
the fibers, 


designated 
bursts of 
“quency impulses only at the 


id termination 


1 \ pe. cave 
sudden 
stimulation 
when stimulus 


(or 


1T1¢ decrease) Ol 


rease OT 
nd 
alues were not changing. 

Marshall Palbot hint that the 


Vicar Shall and 
fibers may serve 


remained sile it 


to evaluate 
81), while 


rs provide the basis for 


brilliance (but see 


nections. 


! Physio- 

is would be a very 
‘ffective mechanism for generating 
‘on-off”’ responses since 


the recep- 


tors are subjected to rapidly changing 
illuminance 


bac k 


the diffraction 


values as they sweep 


and forth over the contours of 
going in and 
some 50 per 
Granit has suggested that 
this continued elicitation of 


must be 


Mage, 


out of action 


limes 


j 


SECCOILL 
‘*on-off’ 


responses 


| instrumental in 
producing the accurate discrimina- 
tion of contour (27, pp. 168-169). 
In any case, the vertebrate eye is 


well equipped to respond to changes 
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in illuminance. ‘Off’ fibers are 
apparently rare in mammals, and 
mammalian ‘on-off’ responses un- 
dergo complex transformations as the 
parameters of stimulation are altered. 
Granit states that the cone system Is 
organized for the interpretation of 
changes in the visual tield, and that 
the off-efiect in ‘‘on-otf” tibers plavs 
a major role since it has a shorter 
latency than the on-effect and can 
be quickly inhibited (27, p. 167). 
The oftf-effect may be inhibited by 
re-illumination, the inhibition 
curring with a shorter latency than 
the new “on’’-response (27, pp. 90 
92). Further experimental work by 
Granit (29) has shown that 
and off-components ot 


OC- 


the on- 
“‘on-oft’’ ele- 
ments are mutually inhibitory. Con- 
sidered with 
such functional 


along ocular tremor, 

would 
provide a high degree of sharpening 
of image borders. 

Hartline found that it was possible 
to map a receptive field on the retina 
for single fibers. Hllumination anv- 
where within a tiny 


mm. or so in diameter would produce 


antagonism 


retinal area 1 


a response in the fiber. The receptive 
field was of fixed type, being of the 
“en.” “oft.” or “‘on-oft"’ Variety, and 
the fields of different fibers were ob- 
aother. A 
recent investigation by Wutiler (45, 
46) on the retina of the cat vielded a 
rather 


served to overlap one 


more complicated arrange- 
ment. Keceptive tields were found to 
contain hree concentric 
arrangement 


“‘off,”’ or 


zones in 
which 


‘‘on-oft”’ 


allowed ‘‘on,” 


discharges to be 
obtained from any one ganglion cell. 


There exists a central area of low threshold 
as tested by a small spot of light. The dis- 
charge pattern of the central region is the op- 
posite of that found in the periphery or sur- 
round, The center may give predominantly 
“off,” the surround ‘‘on” discharges, or the 
reverse. An intermediate region gives ‘‘on-oft"’ 
discharges (46). 


The type of disc harge recorded at 





JOHN L. FALK 


the ganglion cell depended upon a 
variety of factors such as background 
illuminance, intensity, duration, size 
and location of the exploring spot- 
stimulus within the receptive field. 
But perhaps of primary importance 
for the problem of visual acuity were 
the mutual modifications operating 
between the zones within each recep- 
tive field. Zonal was 
studied by means of simultaneously 
exciting the central and marginal 
zones With small spot-stimuli. Stimu- 


interaction 


lation of an “‘on’’ center field at the 
center and at the surround (‘‘off” 
zone) showed that center and_= sur- 
round tended to suppress one an- 
other. “Ott center tields operate in 


Field centers 
but if stimula- 
tion of the surround is more intense 


an analogous fashion. 
tend to be dominant, 


they are inhibited. Thus, ‘‘on” 
center responses are modulated by 
inhibitory surround activity and 


produce a large variety of discharge 
patterns. 

There are numerous other complex 
effects, not to mention the overlap 
of receptive fields. Our purpose ts 
merely to indicate how zoning of the 
receptive field might function in 
acuity tasks. IKufliler (46) points out 
that with this antagonistic arrange- 
ment of central and surround zones 
within receptive fields, a slight shift 
of the eve can produce a great change 
He remarks 
movements 


in the discharge patiern 
that scanning 
“should be advantageous in the per- 
ception of contrast and in acuity.” 
If we consider an 


on ular 


center field 
moving from left to right in Figure 3, 
it is clear that as the ‘‘off’ surround 
moves into the region where the il- 


“on” 


luminance gradient is steep the center 
will be inhibited. As the center 
crosses the gradient there should be 
a sharp “escape” from inhibition, 
followed by a quick diminution of 
firing (suppression) owing to the re- 
instatement of inhibition as the left 
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half of the ‘‘off’’ surround transects 
the gradient. Such modes of opera- 
tion in the retina would seem to be 
well suited for the production ot 
sharp contours, while field) overlap 
would refine “‘retinal local signs.’’® 
However, certain stimulus 
factors may also bring about the for- 


static 
mation of well-defined edges. Gra- 
ham Granit (26) have 
that in the fovea there is inhibition 
of the more weaklv 
when differential 


and shown 
activated area 
stimulation is ap- 
Likewise, Hart- 
et al. (34) observed that even in 
the compound lateral eve of Limulus, 
brightly illuminated areas of the eve 
inhibit the activity of dimly lighted 
regions more than the latter inhibit 


In both 


1 
enhanced. 


plied to. two areas. 
line 


the activity of the former.” 
would be 


cases contrast 
The experimental data on conditions 
vielding best 


confirm this relationship. 


performances 
Acuity 
surround luminances 
the the 
background (13), pre- 
to suppression of the 


| is eT (20 


acuity 


deteriorates at 
than 
test object 


’ 
bin aue 


ereater luminance of 
Sula 


area to be resolved. has 


1 
1 


I 
he size of a 


shown that increasing 1 
dimmer surround enhances acuity 
still 


pl OCeSSeS 


There are other aspects to 


and the 
in which they may be instrumental 
resolution. Phat 


should increase with light intensity 


inhibitory 


Was 


in image acuity 
is somewhat paradoxical, for under 
such conditions greater spatial sum- 
should operate to increase 
blurredness. Polyak (64, p 
that increases in in- 
only 


mation 
image 
130) suggests 
not 

increased response in the cones, but 
also spread inhibitory influences to 
surrounding areas via the horizontal 
This collateral 
suppression is said to limit the areal 
effects of local stimulation and im- 


tensity give rise to an 


or amacrine cells. 


® But it is well to remember that the cat has 
no true fovea and is a nocturnal animal sac- 


rificing acuity for sensitivity. 
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prove acuity. There would probably 
be at least two factors operative in 
the production of such suppression. 
On the one hand, there is the contrast 
elicited by ditterential stimulation, 
as explained above; lateral inhibition 
makes for higher acuity. The finding 
(50) that inhomogeneous _ back- 
grounds yield greater acuity per- 
formances than homogeneous grounds 
with the same test-object might be 
due to the generalized inhibition of 
induced by in- 
background — stimuli. 
On the other hand, it has been sug- 
gested by (16, 
49) that light adaptation itself some- 
inhibits the action of collateral 


channels, 


spatial summation 


homogeneous 
some investigators 
how 
the foveal cones 
units. Willmer 
has investigated the influence 
upon the summation 
area as revealed by field sizes above 


allowing 
Lo behave as single 
(84 


of luminance 


brightnesses re- 
With 


luminance above threshold levels the 


which subjective 


main constant. increases in 


areas were found to de- 
1: 
i 


¢ ] 
Moclil 


summation 
ing that polysynap- 


crease, “‘11i 
nerve cells probably 
become less important; this may be 


tic bipolar and 


one of the factors in increasing visual 
acuity with higher illuminances. ...”’ 

Che fact that central flicker inter- 
action may occur between two light 
patches without the disruption of 
fine-line discrimination in the inter- 
(30) indicates that the retina 
is capable of minutely differentiated 
responses. In this case, there appears 
to be interaction area in 
which collateral must 
be manifest. 

Mention should be made of the re- 
cent work of Motokawa (55) on the 
“field of retinal induction” as it 
bears on the problem of acuity. Us- 
ing an electrostimulus method (de- 
scribed in 22), fields of induction sur- 
rounding retinal images in_ the 
human were mapped. Field 
strength strongest at figure 


Space 


across all 


suppression 


eve 
Was 
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borders, and for optical illusions was 
distributed in wavs which were re- 
lated to the characteristic perceptual 
phenomena. The Landolt ring and 
vernier-type acuity test-objects were 
analyzed, the vernier task inducing 
the more marked field deformation. 
This result is said to account for the 
much higher acuity measures 
tained with this figure. 
Although acuity as a 
the color illumination has been ex- 


ob- 


plored somewhat, the factor of chro- 


matic contrast as a basis for resolu- 
tion has been almost entirely neg- 
lected and is given no place in any 
Although it is not 
task, 
tion of the numbers on the pseudo- 


isochromatic plates is entirely a wave- 


theory of acuity. 


strictly an acuity discrimina- 


length discrimination, luminance con- 
trast cues being eliminated 
(78) puts the case: 

When at last someone determines vis 


acuities for colored targets « 


Ler ual 
n ce le red ore unds 


of equal brightness, the advocates of bright- 


1. Apter, F. H. Problems in the physiology 
of visual acuity. Arch. Ophthal., 1934, 
ll, 6-19. 

2. ADLER, F. H., & Firecetmax, M. I 
fluence of fixation on the visual acuity. 


al., 1934, 12, 475-483 

3. Apriax, E. D., & MattrHews, R. The 
action of light on the eye. Part II. The 
processes involved in retinal excitation 
J. Physiol., 1927, 64, 279-301 

4. ANDERSEN, E. E., & Weymoutn, F. W 
Visual and the retinal 
mosaic. 1 1923, 64, 
561-591. 

5. Bartow, FI. B. Eve me 
fixation P iM 
306, 

6. BARTLEY, S. H. Vision: a study of it 
basis. New York: Van Nostrand, 1941. 

7. Bartiry, S. H. The psychophysiology of 
vision. In S. S. Stevens (Ed.), Hand- 
book of experimental psychology. New 
York: Wiley, 1951. Pp. 921-984. 

8. BerGer, C. The dependency of visual 
acuity on illumination and its relation 


Arch. Ophti 


per ( ption 


r m } 
Amer. J. Physiol., 


vements during 
1952, 116, 290 


JOHN L. 


modes of 


\s Walls . 





FALK 


as the ‘sole’ basis of 
resolution will be given new food for thought. 


ness discrimination 


The aim of this section has been 
not primarily to argue for any specific 
retinal interaction, but 
rather to indicate certain fairly com- 
plex mechanisms which might play 
important roles in resolution. It is 
felt that the Marshall-Talbot theory, 
as it stands, takes insuthcient ac- 
count of the acuity-producing actions 
within the retina proper. One must 
keep in mind that the visual svstem 
resolve stimulus differentials 
embodied in differ- 
ential retinal responses which may be 


cannot 
unless they are 
channeled through the optic nerve in 


temporal sequences if not spatial 


distributions. In view of the evidence 
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THE PROBLEM OF INFERENCE FROM CURVES 
BASED ON GROUP DATA 


W. K. ESTES 


Indiana Uni 


Haves (4), 


have raised serious 


Papers by Sidman (8 
and Merrill (6 
questions about the validity of in- 
ferences trom curves of functional 
relationship based on averaged data. 
By means of mathematical arguments 
numerical illustrations, these 
have 


. given a mean 


and 
writers 
that * 
form of the individual 


convincingly 
the 
curves is not 
This 
demonstration strikes close to home 
for the theorist In the 
study of iuiterested 


behavioral changes in 


shown 
curve, 


uniquely spec ified’ (8, p. 268 


learning 
; 

earning, We are 
in describing 
individuals, but owing to limited con- 
trol over behavioral variability must 
frequently averages 


depend upon 


for groups of organisms to determine 
relationships 
areas we could 
business if it 

the mean 
vide the 
make statements concet 
tion for the individual” 


functional In many 

scarcely renic ! 
4 1] 

were aCtuall\ 


curve at 


information 


: JOS& 
More ac- 


regard the 


Unfortunately it is true. 
curately, it is true 1f we 
mean curve solely as a source of in- 
ductive generalizations. ‘This qualift- 
cation suggests that pe ssiblv the fault 
eck « but 


lies, not in the averag urves, 


terpretations ol 


in our customary 
them. 

It is 
theory, even learning 
theory, has made rath ady 


notewortis th iearnine 
aquantit 


prog 


ress in spite of the w idespread ac- 


ceptance of a false methodological 


assumption. Apparently interences 
from averaged curves, although not 
necessarily correct, must in fact often 


be so. This being the case, researchers 


134 


erstty 


in learning are unlikely to give up 
readily the habit of computing mean 
curves of relationship 
\ly purpose in this note is to show 


functional 


that we need not feel obliged to try. 
The group curve will remain one of 
our most useful devices both for sum- 
marizing information and for theoret- 
ical analysis provided only that it is 
handled with 
understanding. 


a modicum of tact and 


The principal point to be made is 
that the valid treatment of averaged 
curves depends upon the same prin- 
that 
have become faimilar to all of us in 


ciples ot statistical inference 
such cases as the analvsis of variance 
lest. 


a group of organisms 


and the chi-square Just as any 
mean score tor 
could have drisen from sampling 
any of an infinite variety of popula- 
also could any 


tions ol scores, so 


mean curve have arisen from 
ota infinite Variety ot popul Bi 
sof individual curves. Therefore 


no “inductive” inference from mean 
curve to individual curve is possible, 
and the uncritical use of mean curves 
even for such purposes as determin- 

the 


treatment upon rate of learning o1 


effect of an experimental 


rate of extinction is attended by con- 


siderable risk. These considerations 
set rather severe limitations upon the 
use of mean curves in the study of 
learning. Nonetheless we can antic- 
ipate that, as so regularly turns out 
to be the case in scientific research, 
our virtue in accepting these limita- 
tions will not go unrewarded. The 
same type of theoretical inquiry that 
has led to recognition of the need for 


caution in handlingaveraged datamay 
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be turned ina construc tive direction 
and lead to more effective exploita 
tion of the one defensible and im- 
portant theoretical application that 
the 
hypotheses about 


remains for the averaged curve 
testing of exact 
individual functions. 

The first step in this direction is to 
recognize that the effects of averag 
ing are not in any way capricious or 
unpredictable and need not be re 
distorti 


garded as artifacts o1 


Distortion arises only if unwart 
inferences are drawn from the 
curves. But given any specified 
sumption about the 
dividual functions, we can 
deduce the 
expected of an 
then to test these predictions agai 
obtained data. As in 


lorm ol! 
pro 4 
} Le en 
characteristi 


Ver ived 


See 
statistical in 
that o 


vield the same predictions 


rence 


{¢ 
il 


true ler 

undertaken will be, however, 

the in possible 
but 


under consideration 


Imitv ol 
the 


not 


eses, only one hi 


In testing quantitative 
against averaged data we n 
ceri ed either (a 

functional relationshy 
parameter values 

of organisms sampled. 
illustrated by the formerly 

pastime of trvi 

form ot } 

the attempts to verity Hull's hypoth- 
that h 
ponential 


esis ibit) stre 
nction 
reinforcements (5 
trated = by 
whether the slope parameter of the 
habit growth curve depends upon 
f reinforcement (11) o1 


attempts to 


amount ot 
whether the rate and asymptote ot 
maze learning are functions of stim 
ulus variability (9). 

In studies involving 
has been customary to operate on t 


Case a, it 
1 


ie 


tacit assumption that the torm of a 
mean curve will reflect faithfully the 
Since 
this assumption is now recognized to 
inted, 
expect averaged data to yield any 
direct answer to the question, “‘What 
the individual func- 
1, however, replace this 


form of the individual curves. 


be unwarr. we can no longer 


is the form of 


question with one which can be an- 
namely, ‘Is the form of the 
empirical curve in accord with 

m that the individual 


a given form, sav 


Were 1, 


re ol 
y=f(x, a, b, 7’ (In the remain- 
der ot the lise ussion 

nt by 
nt variable y to an independent 
ena ae 


a ) 
we shall repre- 
f the function relating a de- 
imeters a, 5, etc.) 
a specific mathematical 

research problem to 
iny given function f 


edictions be 


pr can 
curve 
Some pre- 


ms that mav be 


the mean for 


inisms 


Ine with this tvpe ot 
discussed below. 
nvolving Case b the as- 
made 
for the 


\ roy a 


been 
ined 
‘function describing 
a group ot or- 

‘ ea, b, AS 
the same form 
il to the means of the 
ndividual parameters. 


with 


not generally 
this case will 
the 
assumption 
investigating in- 


recognizing in- 
ich the 
secon l, 


} 


hich it does not hold in 


ances in wh 
holds, and, 
stances in W 
order to determine what information 
about parameter values is obtainable 


from the mean curve. 


CLASSIFICATION OF FUNCTIONS 


Relative to these problems, the 
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mathematical functions that we will 
have occasion to deal with can be 
classified into three types, each call- 
ing for somewhat different treatment. 
Let us consider briefly the problems 
that will arise in dealing with each of 
these types and illustrate some of the 
procedures that will prove useful in 
dealing with them. 
Class A. Functions 
averaging. In 
curve for the group has the form ot 
the individual function and the 
rameters of the mean 


WMHPLOd! fled by 


these cases the mean 


pa- 
curve are simply 
the means of the corresponding in- 
dividual parameters. Phe chiet prob- 
lem here is that of defining the class of 
functions so that we 
instances of it. 


will recognize 
he essential charac- 
teristics of the class will be apparent 
from consideration of a few examples: 


A numerical illustration involving 


les will show in ral 
| 


one of these exam] 


concrete way how the averaging 


process works out for this type ol 
function. 


have 
two organisms whose behavior in a 


Suppose that we 
learning situation is described by the 
function y=a leg x, 


2 where a is a 
which 


constant varies in value from 
one organism to another, but remains 
throughout learning 


fixed in value 


for any one organism. Let y; and 
ye be response measures for the two 
the value of a be 
1 for the first organism and 2 for the 
second. 


organisms, and let 


Then the course of learning 
for the two organisms will be de- 
scribed by the equations 

vi=log x 


D 


and 
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yo? lor x, respectively, 


Now we compute the “empirical” 
response measures for each organism 
for the first four values of the inde- 
pendent variable x as indicated in 
Table 1. Then by averaging the two 
response measures at each value of 


rABLE 1 


ErrECT OF AVERAGING A SIMPLI 
LOGARITHMIC FUNCTION 


log \ 


1.20 


the mean 
curve represented by 
the 
however, 


v, we obtain “empirical” 
the values in 
headed y. It 
that mean 
values also represents the values ot 
the function = 1.5 log x. 
function describing 
curve is of the 


column is clear, 


the column. of 


Therefore 
the 
form as the 
individual functions, and the param 
the function 


the mean 


same 
eter ol describing the 
mean curve is the mean of the individ- 
ual parameters. 


All) functions 
class work out similarly.! 


belonging 


what 
is that para- 
meter in the function appears either 


the simplest terms, they 


have in common each 
alone or as a coethcient multiplying 


a quantity which depends only on 
In aver 
the 


value of x 


the independent variable x. 
aging, any 
sort 


quantity ol 

each 
the 
mean 


factors out at 
appears in 


the 


and mean 


value of the 


Curve, 
multiplying 
parameter. 

Class B. 
ing cemplicates the interpretation of 


leaves form unchanged. 


‘J ¥ } 
Functions for which averag- 


parameters but 
Examples of functions falling in this 


class’ are 


1 See Mathematical Note 1. 








encountered in some guise or other in 
many learning theories, and given 
detailed discussion in Sidman’s pa- 
In the first example, we can re- 
write the function in the forn In some cases, a function belonging 


class can be moved into Class 


log b+ log 


or even Class A by means of an 
then it is apparent that the mean appropriate transformation. Take, 
curve for a group of organisms which rexample, the exponential function 
differ with respect to parameter ive % - If the value of the pa 
will be logarithmic in torm, for the ameter a is known tor all individuals, 
same reasons discussed 1 > pre { in be subtracted from the re- 
ceding secuion, but will have the ) | measure y, leaving us with the 


1 1 


= g 
mean value of log 6 rather than lo mpler equation 


b as the intercept constant. Thus 
from a mean empirical curve, we cat 
obtain an estimate of the geomet he te » made more trac- 
mean of the parameter 0 tot able by the logarithmic transtorma- 
organisms sampled, but no estimate 
of the arithmetic mean of 0. 

In the second exalt iple, 


rw ill be 


. ; . ‘presents the me wl, or 
estimate ; or ? tained x Ad ilue of the term in paren- 
from the average oe a eZ . {. n. 1 ike logarithms 

the ee ae ewe base pendent variable 

functions in this Lis nT y’ and ol A RE aE 
culties if we are int { ~ Sat al dies Mae Re i 

m ol le Tun neal \ ! ny individual and the averaged 

curve tor a group will be linear; from 

ae - can obtain esti- 

changes parametet alu mates of the mean value of the param 
function of experimental itments sisi e-ncsik tak aibint eae caer 
then care must be taken to allow tor he parameter i, a ors 
ee ee — rategem the problem of testing the 
Class ( hypothesis that an exponential func- 


hoary 
) avere 


tion holds tor individual organisms 


this class uns any terms has been reduced to the very simple 


involving the independen ha problem of determining whether the 
x which will 1 r out he ’ n 1 Inve plotted from the trans 
sum value vy over grouy formed ta departs significanth 
organisms for a 5 Similarly 


; 
. linearits other 


Phe most familiar Ipre ‘ ypotheses that might be tested 
tion belonging te las wainst the group data are greath 
“growth” curve simplified. Suppose, for example, 
Sn5 tint aanteest thata theoretical curve ot ENTIM tion 
3 See Mathematical took the form of this exponential 
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function, with yv being a 
measure, x 
the asymptote @ equal to zero, and 
that we were interested i: 


tion whether some difference in the 


response 
number of trials, and 


1 the ques- 


experimental treatments given two 
groups of organisms influenced rate 
of extinction; by means of the sug 
this 


transformation, 
that of testing 


vested prob 
lem would reduce to 
for a difference in slope between two 
A variety of trans- 
may be useful in 


have 


regression lines. 
formations which 
situations of this sort 
discussed by Mueller (7 
Even 
cannot be moved 


1 
been 


when functions in 


into one of t] » more 


{ 
2 
docile classes by anv available t: 


ans- 
when for 
the 
be the ca 


rmation 


formation, or some 
transformation ol 
desirable (as might 
contemplated 
heterogeneit. ol Val 


reason 


t 


data is 
transl 
duced ances 
along the curve), we are 
‘| he extent 


not neces- 


sarily helpless. to which 
functional form is moditied by aver- 
aging will generally depend upon the 
dispersion of parameter values in 
the group of organisms sampled; thus 


in some cases it may be possible by 


studying individual urves esti- 
mate the dispersi« n ot 
values in 


whether the form « 


parameter 


the group and detert 
the mean 
can be exper ted to conform « 

to the form of the individual fur 
e-o.. (3 
the 
functions, it will usually be possible 
| analy- 


the main characteristics 


tions: see, Further, even 


in the case of most reiractory 
by appropriate mathematica 
sis to derive 
that should be predicted for an aver- 
aged curve; an analysis of this sort 
for a ‘‘growth”’ function has been de- 

>? 


scribed in a recent paper (2). 
THE ROLE OF EXPERIMENTAI 
IE-RROR 
The analysis given here might be 
objected to on the grounds that we 
have considered only the effects of 
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averaging upon data obtained trom 


idealized which behave 
strictly in accordance with theoretical 


organisms 


Lunctions Kesponse measures ob- 


tained from real organisms on 
the hand, be by 


various sources of experimental error 


mav, 
other influenced 
as Well as bv the variables taken ac- 
Phe ob- 
fatal 


+ 


count of in a given theory. 
jection is pertinent, but not 
Che answer is that in testiig itheore 
ical prediction one must make 
explicit assumption about the ro 
test situa- 
test, 
will 
the degree to 
which such assumptions are satish 


experimental error in the 
tion. n ‘ in anv statistical 
the 


be « 


‘ lusions 


In some instances, it may 
sonable to assume that 
tion of experimental error is 
ibove 
Fre- 
quently it will be more reasonable to 
the 
in working with at 
that 


with tre 


then the analvses given 


will apply without modilication, 


operate assumption, rou- 

tinely m 
models, error 

bines additivel ‘atmien 

ine the obse 

sponse 

wish 

function 

individu 

observed respon 

individ 

and a 


resents 


Now if 
pendent of x, and if 
falls Class A, 


individual curves will 


variable 
the 
averaging of 
vield a mean 


the error 
function f 


in our 


curve described by the function 


If the mean value of e is zero, which 
example, be the case when- 
distribution of 


will, for 


ever the errors is 





normal, 


YW 


1, 
8 | 
1 


> latter 
stas 


ae 


then the form of the mean 
ill be u 


the me 


1On 


mati 
and or 


in the 
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pothesis under test, and then to derive 
naflected by the erro the properties that should hold for 
i the averaged curve if the hypothesis 
is correct 
« st il I ta | > d 
irve will be changed } 


ical displac ement. | 


inl is 


ot zero, then tl 


I 
will be modifies 


If the predictions thus 
erived are in accord with data, the 
ypothesis remains tenable; if they 

not, then the hypothesis can be 


Ol 


rejected at some specified level of 
confidence. Utilized within this 
iramework, tl 


be 


most valuable techniques for the 


1e averaged Curve Can 
expected to remain one of the 


ilvsis of behavioral data, and in 


to increase progressi\ ely in 
> as mathematical and statistical 
ntinue to enlarge our rep- 
special devices for the 
ff particular problems. 
MMARY 


ATHEMATICAL NOTES 
ore formal criterion 


is desirable for son 


‘ 


ay be formulated 
consider 

\t an 
mav regard 


paramete 


the value of the a 


a given organism; f,' 


he criter 


iper will hold for all functions 
in be expanded by Tavlor’s theorem, 
which includes all the 


iB 


elementary 
functions and, in fact, all explicit functions 
that the psychologist 


st is apt to have dealings 
with. 
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represeits the 7th derivative of y with 
respect to a, evaluated at a@=d; and 
soon. When the function is averaged 
over a group of mdividuals, we obtain 


Our criterion tor inclusion of a fune- 


tion in Class A mav now be stated: 


if in the Taylor's series dey clopment, 
all second and higher order partial 
derivatives of the function with re- 


spect to parameters are zero, then 


the function is unmodilied by 
aging. 
y =a log x, we have fa x ft 0: 


aVer- 


Applying the criterion to 


and therefore J=d log a 
ment with 
above by a 
2. A sufi 
sion of a function v=/(x, a, 6, 
in Class B is that it does 
the Class 
panded around d, 6, etc., but 


avree- 


the conclusion reached 
more informal route. 


lent criterion tor inclu 


lot Satisty 


Nau 
fk when 


criterion ot 


) 
1 
1 
i 


ex- 
does 
rewritten 
expanded 
it, 2D, Ste; 


satisfy that criterion whe 


Metie. tt tC --+-) and 


around 4, 


being 
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aves, AK. J. 
method for th study oft 
Psychol. Rev 
Buen, € ] ) New 
York: Appleton Century-Crofts, 1943 
MERRILL, MARGARET. ‘The relationship 
of individual growth to average growth. 
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learning. 
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functions of the parameters a, 0, 
). In the first example under 
Class B above, this criterion is satis- 
fied if we let log b=a; in the second 
example, it is satished if we let 1/a = 1 
and b/a=v. 
a i 
then 


a function falls in Class C 
in the Taylor's series develop- 
the 
derivatives 


ments described above, some ot 
higher order 


will depend on a 


Sci ond OT 
regardless of how 
thus the 
Aor Class B cannot 


%,. 2, Cic.- Aare chosen, and 
criteria for Class 
be satisfied. 

It will be noted that 


criteria 


these formal 
provide more rigorous deti- 
the 


can be given in 


than 
nonmathematical 
should be em- 


phasized that the conclusions about 


nitions ot Various classes 


lerms. Liowever, it 
inierence tron) averaged curves that 
we have reached in this paper do not 
depend on 


abstruse mathematical 


analyses. In many practical situa- 
tions, questions con erning the etlects 
handled by 
: } 
simple numerical methods of the type 
illustrated in a 


of averaging can be 


n earlier section. 
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\NALYSIS-OF-VARIANCE TESTS 


IN THE ANALYSIS 


AND COMPARISON OF CURVES'! 


DAVID A 


niversity of Wisconsin 


7 


The this 
extend methods presently ay 
the 
curves 


purpose Oo} paper is 
ailable 
and the com 
obtaine 
chological experiments 


WwW ill be 


ue quaintane eship 


analy sis 


for use in 
parison ol 
assumed to have 
with 
variance techniques at 

level, such as ¢ ould be obtaine 
the lable 


tron 


most ol better avai text- 


books. The present paper will deal 
with the inalysis of curves that result 
when the ditference between expe 

mental treatments involves a scaled 
independent variable with equal steps 
(or equal logarithm steps he LWee 
levels ot variable 


Examples would be learning curves, 


the indepe ident 


extinction curves, dark adaptation 
curves, time-to-attain a level of dark 
adaptation as a function of the lut 
nance of the preadaptation 

sponse 


rate as a lunction of 


amount of reinforcement, etc. The 
procedures apply to most curves or 


sets of curves in which y, the de- 
variable, is plotted as 


function of x, the independent vari 


pendent 


] 


all> 


able, where there are equal inter, 


between the values of x. 
that 


tion in 


It is assumed 


there is random normal \ 


the depei lent variable al 
with 
There may or may 


each data point equal errot 


Variances not be 
parameters expressing qualitative or 
additional 


1oOg- 


quantitative Variation of ( 
independent variables that are ort! 


onal tO x. 


' This paper was written during a research 


leave supported by the Graduate School Re- 
search Committee of the University of Wis- 
consin from funds provided by the Wisconsin 
Alumni Research Foundation. 


GRAN 


ANALYSIS OF SINGLE TRENDS 


Phe tirst procedure, which will be 
briefly presented, has been described 
in Fisher and Yates (6) and Pearson 

nd Hartley (8) and in other readily 
available sources. 

\s an example, consider a selected 

data from the experiment of 

int and Schiller on the generaliza- 
tion of the conditioned GSR to visual 
stimuli (7). In this experiment seven 
different groups of subjects were all 
conditioned to vive GSR’'s to a 12- 
inch visual The seven 
groups were then extinguished on 9- 
inch, 10-inch, 11-inch, 12-inch, 13- 
14-inch, and 15-inch visual 

The 


GSR on 


stimulus. 


luli, respectively. average 
the initial 
extinction trial, using the log-conduct- 
ansformation for 14 subjects 
group, is given in the broken 
igure 1. 

It was expected that 
might general tendencies. 
First, experimental steps 
were taken to prevent it, it was ex- 
pected that there would be a tend- 
ency for the stimult to pro- 
duce larget In other words, 
the tend to drift up- 
ward from left to right in Figure 1 
as indicated by the labeled 
“linear component.” Secondly, it 
was expected that the data would 
show a generalization function which 
might 


magi itude ot 
ance WU 
in each 
curve ol 
these data 
show two 


although 


longer 
GSK ’s. 
would 


means 


line 


be a symmetric, decreasing 
function about the 12-inch test stimu- 
lus as indicated by the curve marked 
The ob- 


tained data seem to bea composite ol 


“quadratic component.” 


both of these tendencies plus some 


random variation. It is obvious, 
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GENERALIZATION TEST STIMULI 


Fic. 1. First GENERALIZATION TEST 


MENT; 


the 
unre- 


under these circumstances, that 
between groups F is singularly 
vealing. The means for the generali- 
zation test data appear in the 
column of Table 1, and the analysis 
of variance is summarized in the 
lower part of Table 1. The be 
groups mean statistically 
significant as shown by the F of 2.23, 

but the conclusion permitted by this 
namely, that the group means 
differ significantly, has little 
on the phenomena with which the 
experiment deals. 

Far better tests of the 
hypotheses in which we are 
can be obtained from rather elemen- 
tary mathematical 
considerations. In 


seond 


tween 


square 1s 


test, 


bearing 


specific 
interested 
and statistical 
general if y, the 
dependent variable, is a 
x, the independent variable, 
may be expressed in 
power scries. 


function of 
then y 
the form of a 


V=dytaixt+aqx*+agx3+ +--+. [1] 
Independent statistical tests of the 


linear, quadratic, cubic, and higher 


TRIAL DaTA FR 
TRANSFORMED GSR PLOTTED AGAINST VALUE OF THI 


mM GRANT-SCHILLER EXPER! 


Pest STIMULI 


curve can be ob- 
because the 


components Of a 
tained 
mav be 


above formula 


Written in the form: 


where o, \ is the orthogonal 


ial of :th degree. 
ditions of normal, 


poly 


nom Under the con 
homoceneous, ran 
dom variations about the data points, 
levels 


A; can 


ind equal intervals between 
ot the independent variable, 
be independently estimated = and 
tested that independent tests are 
available for the 
nificant linear, 


existence of sig- 


cubic, ete 
components of the trend. The 


quadratic, 
proce 
standard (6, pp. 27-29; 8, 
pp. 91-95 

In the -Schiller experiment, 
the hypothesis that increasing size 
of stimulus is increasi 
GSK 
the sig 


dure is 
Grant 
related to 


1g 
is best investigated by testing 
linear 
ponent of the trend, and the hypoth- 
that 


gradient 


rmificance of the com- 


generalization 
symmetrically 


there is a 
decreasing 


esis 





Vf 


12-inch test stimulus is 
best investigated by 
cance of the quadratic component of 
The pro edure for making 
First of 


all, the group totals are written down 


from the 


testing the signifi- 


the trend. 
the test 1s simpli itv itselt 


hree ot Table 1. 
] 


C- 
i 


in order, as incolumn 
There are seven data points tor 6 « 
between 
itribut 
components 
es of the six 
1or e ich 


obt 


S 


of treedom group 


that 


grees 
totals, so the co ion of 


SIX orthogonal 
he valu 
nomials 

L pp 
ind | 


wore Ire 


i 
~t] 
ul 


ey 


bcl 
re 
LT ¢ 


i hese \ i] ICs 


porti 
} rul 


nomial, 


and 


QOuintic 
Sextic 


F-VARIANCE 
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written in 


CTOSS- 


been 


Sums 


coetlicients have 
column headed I. 
products are then computed for each 
of the orthogonal polynomials. These 
are obtained by summing the prod- 
times the 
The sums 


of 


ucts of the polynomial 


respective group totals. 
ol cross-produc ts are labeled P;, and 
they appear at the foot of the upper 
part ol Fable 1 Under column 
headed | is the sum of all of the group 
totals or the grand total GT. To ob- 
tain P,, the sum of cross-products for 

Ist degree polynomial, we have: 
21.34( —3) + 37.01(—2) +59.87(—1) 
+ 54.09 (0) + 54.10 (1) + 39.03 (2) 
3) = 89.83. Similarly, under 
have 21.34(3) 
+ 54.09 (6) 
+ 51.86 (3) 
Table 1 


SIX sets 


+51. 86 
OD; we 

+ 59.87 (1 
39.03 (— 7 
Also given in 
of the 


is 


lor each ot 


Dat 


SCHILLER 


31.4938 
78.0098 


58 


— to nw te 


oO 


0.0349 


4.6292 
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polynomial coethicients. Thus, for 
the first... orthogonal — polynomial 
Doe Gu? = (—3)?+(—2)?+(- 1)?+(1)? 
+(2)?4+-(3)?=28. The sum of squares 
for each orthogonal component is 
then simply: SSp,=P?/n Do, O32 
where » is the number of scores 
making up each group total. More 
generally, this formula can be writ- 
ten: 


SSp,= ( > T.@ ) [ (xx o ) [3] 


where the subscript, 7, refers to the 
order of the polynomial and the sub- 
script, k, refers to the level of the 
independent variable. 

For the component, then, 
the sum of squares will equal 
(89.83)?/(14)(28) or 20.5853; and for 
the 4th polynomial the sum of squares 
will be (125.83)?,/(14) (154) or 7.3438. 
The sums of squares are entered in 
the summary of the analysis of vari- 
ance table. A good check on the 
computations is that the sum _ of 
squares for all of the orthogonal com- 
ponents must equal the SSjperween grou 
In this particular instance, only th 


linear 


3 


e 


linear and quadratic sums of squares 
would be computed to test specific 
hypotheses, and the additional com- 
ponent sums of squares need be com- 
puted only to check the accuracy of 
computations. 

When the mean squares resulting 
from these 
against the error variance, it is found 
that only the quadratic component is 
statistically significant, although the 
linear component approaches signifi- 
cance. It is possible to 
conclude that there is a significant 
quadratic trend in the data of Fig- 
ure 1, or that there is a statistically 
significant generalization effect. The 
other orthogonal components have 
been tested in Table 1, but, for 
reasons given by Duncan (4), it is 


components are tested 


therefore 


DAVID A 





GRANT 


not a particularly wise procedure to 
test components tor which there are 
not specific a priori hypotheses. 
The above analysis can generally 
be used when a different group of 
subjects is run under each of the 
levels of the independent variable. 
When the same subjects have been 
used in repeated tests under different 
values of the independent variable, 
it is subdivide the 
SScorror into orthogonal components 


hecessalr\ to 


in order to obtain appropriate error 
the com- 
ponents of the over-all trend. The 
procedure for subdividing the inter- 
action will be outlined as a special 
the procedure de- 
scribed below tor repeated measure- 
ments. 

A word of should 
serted here with respect 


terms for testing each ot 


case of general 


caution be in- 
to logarith- 
exponential, and trigonometri 
functions. If there 
bases for expecting such transcen- 
dental components in the data, and 
the data are highly reliable, all ot 
the orthogonal polynomial compo- 
nents may be statistically significant, 


mic, 


are theoretical 


and a more definitive test may be 
obtained by fitting the appropriate 
parameters. This type of 
beyond the scope of the 


paper. 


test Is 


present 


ORTHOGONAL POLYNOMIAL ANALYSIS 
OF TRENDS BASED UPON Rr- 
PEATED MEASUREMENTS O1 

THE SAME Ss 


Alexander (1) presented a 
very useful technique for the analysis 


has 


of trends based upon repeated meas- 
the individuals 
The present paper extends the pro- 
outlined by Alexander by 


urements on same 
cedures 
further 


components ot 


analysis of the orthogonal 
the trend and by 
providing for the separation of orthog- 
onal be- 


tween groups which may be permitted 


components of differences 








VST 


OF-| 


the 


desig Il. 


by lovic of the 
Phe notation used is adapted 
from Alexander (1). 

The general functional plan of the 


follows: We 


‘otal of m subjects in p groups, and 


experimental 


have a 


analysis is as 
Cal h subject has contributed a score 
of the 
will be nk 


“ind 


on each of & trials or stages 


experiment so that there 
all. 
vidual trends.” Each group will then 
have a trial so that 
there pk group by trial 


he “group trends.’ There 


in These give the 


scores 
mean on each 
are means 
which give t 
will also be a combined group mean 
on each of the & trials which gives the 
“over-all trend” Ignor- 
ing the scaled or sequent! il character 
of the trials variable, the 
could) be analyzed 
with k-1 df, SS,, 

Si individeats . Wit ®t 
with (k—1)(p 


Jexeziote With (k 


of the scores. 


total sum of 
squares into: 
SS,. 

p—1 
ha i ee 1 
and SS;, 


with 


df: 


b) df. But trials is a scaled variable, 


and the Alexander (1) scheme. of 
trend analysis subdivides SS;, 


iale 
with 1 and 
“ro rity With k 
en down into SSbetwee: 
b— 1 df ated SSccue deviation & 
with (p—1)(k—2) df 
the differences between 
linear 


to be Fad : 
5, ta 


r-a Ss 
f: similarly 


~ dl 


. is brok- 
ero wiabes Wee 
n linearity 
these deal with 
groups in 
components and nonlinear 
ol nd, respectively ; 
ana the SSisaividuaiaw icin 
down into SSperween indi 
n—p df and SSindividual deviations fr 
with (2— p)(k—2) df which 
the 
individuals in linear components and 


components 


is 


broken 
lones Wit 


‘ s 


est ation 


consists of differences between 


nonlinear components of trend, re- 


Our method goes on to 
hk er 


ail ieviat ns irom 


spec tively. 
the 
SS, 


separate 


linearity,» grouy deviations from 


and SS 
ito the quadratic, cubic, quartic, 
etc., components of the over-all trend, 
the differences between group trends, 


earitys 


individual deviations fror estiination 


1k 
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and the differences between individual 
trends, respec tively, so that we have 
group quadratics, between 
group cubics, and between indi- 
between individual 


between 
etc. 
vidual quadrati 
cubics, etc. 

If the p groups form an orthogonal 
design, e.g., variables A and B with 
a—1 and b—1 dj}, respectively and an 
AB interaction with (a—1)(b—1) df, 
squares between 
quadratics, 
cubies, can be separated into 
an A, a B, and an AB interaction 
component. This extension of Alex- 
ander’s procedure is frequently help- 
ful, and it too has been included in 
the example 


then each sum of 


groups (means, slopes, 


ete 


below. 

As an example, selected data are 
presented in Table 2 and Figure 2 
from an unpublished experiment by 
Grant, Kubovama, and Patel on the 

fluence of electric shock stimulation 
on the cor ceptu. |] behavior of *‘anx- 
r In Table 
left-hand side appear the 
perseverative error on the 
second, third, fourth, fifth, and sixth 
stages of the Wisconsin Card Sorting 
Pest (WCST) to the 
square-root transformation (2). The 
values of the transformed 
scores have been plotted for each 
group on the stages 
Figure 2. In this experiment high 
and low anxiety groups were selected 


ious’ and ‘‘nonanxious’’ Ss. 


2, on the 


scores 
as subjected 
average 


successive in 


m the basis on the Taylor Anxiety 
he (10) and 


to three subgroups each, so that the 


ale were subdivided in- 
subgroups could receive 0, 2, and 12 
shocks during the course of 
experiment. 


electric 
the The Ss receiving 
two shocks received them at the first 
stage of the WCST and the Ss re- 
ceiving 12 shocks received two shocks 
per stage on each of the six stages of 
the WCST. ~~ High-anxiety groups 
were designated Ho, He, and Hy, 
respectively, and the low anxiety 
groups were designated [.o, Le, and 
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2 3 


4° 5 


SUCCESSIVE STAGES 


Fic. 2. TRANSFORMED PERSEVERATIVI 


ERROR SCORES PLOTTED AT EACH STAGE OF 


THE WISCONSIN CARD SORTING TEST FOR THE SIX EXPERIMENTAL GROUPS 


Lys, respectively. Data from selected 
sets of four of the 17 Ss in each origi- 
nal experimental group appear in Ta- 
ble 2. 

From Figure 2 it is apparent that, 
as expected, two of the high-anxiety 
groups showed progressive deteriora- 
tion in performance during the course 
of the WSCT. In contrast, the low- 
anxiety groups showed progressive 
improvement throughout the experi- 
ment as did also the H: group. Over- 
all tests of the differences between 
groups and differences between stages 
are not particularly revealing with 
respect to the trends obtained. What 
is needed is a test which will show 


whether or not the improvement in 
the low-anxiety groups as contrasted 
with the failure of the high-anxiety 
groups to statistically 
significant. If the low-anxiety groups 


improve is 


improve and the high-anxiety groups 


this should result in 
statistically significant differences in 
the linear components of the trends. 

The computational procedure for 
the orthogonal polynomial trend 
analysis is essentially simple and 
straightforward, but there are tre- 
quent opportunities for error. A 
number of excellent checking pro- 
cedures are available, but as with all 
calculations, extreme care must be 


deteriorate, 
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TABLE 2 


SELECTED TRANSFORMED PERSEVERATIVE ERRORS AND SUMS OF 
Cross-PRopUCTS FOR TREND ANALYSIS 


exercised to insure accuracy. Sub- 
scripts will be used as follows: 7 refers 
to groups, 7 refers to the order of the 
polynomial, & refers to the trial num- 
ber and / refers to individuals. The 
total number of subjects is ; the 


194 53 8.3 ‘1 


number of subjects in the 7th group 
is m;; there are A trials in all, and the 
total number of groups is p. 

The first step is to enter all of the 


into a 
In the left- 


original scores, by groups, 


table such as Table Z 
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hand part of the table the scores of 
each subject are entered in a separate 
row, and space is kept for group 
totals. The marginal totals are then 
computed, the row marginals in the 
column headed I, and the column 
marginals for each group headed 
the rows labeled T,,., Tox, ete, Thus, 
for the first row 1=1.7+1+1.7+1 
+1=6.4. In the first column of the 
first group the Ty. = Tn=1.7+342.4 
+1.4=8.5. kor each group, the sum 
of the I’s and the sum ot the 
must be equal, which serves as a 
check on the computation of the 
marginal totals. At the foot of the 
table, the totals for each stage, T 
$4.1, 42.8, , 28.3, are computed 
and these add up to the grand total, 
GT, of 194.9 which must equal the 
sum of the group subtotals; i.e., 
27.7+30.8 + 31.7+44.6 + 18.8 + 41.3 
= 194.9, 

The next step is to compute and 
check the P’s. First the table of the 
orthogonal polynomials, @, for the 


. 


number of trials or stages is found in 
Fisher and Yates (6) or Pearson and 
Hartley (8), and these are given in 
Table 3 for the present example. To 
obtain the P's for each subject, the 


following formula is used: P, 
=> veh. For example, for the 
first subject P,=1.7(—2)+1(-1) 
+1.7(0) +1(1) +1(2) = —1.4, 

=, gi te ie (— 

+ 1(2) = 0.0, P; = 1.7 (- 
Or ss sa, )=—0.7, and 
P,=1.7(1) +1(—4) +1.7 (6) +1(-—4) 
+1(1)=4.9. The best procedure is 
to compute all the P;’s and then all 
the P.’s, etc. As a check for each 
group, the sum of the individual P,’s 
for that group should be equal to 
the sum of the products of the stage 
totals, Ti., for the group, times the 
values of the orthogonal polynomials 
for each stage , i for the jth poly- 
nomial, >>, 1 1 Py. For 
example, the P's s for the first group, 


Di. = 
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TABLE 3 
VALUES OF THE ORTHOGONAL POLYNOMIALS 
FOR ANALYSIS OF THE PERSEVERA- 
TIVE ERROR TRENDS 


-1.4 —4.7 —3.8 and —2 

to —12.7. Also 8.5(—2 

+ 6.7(0) +5.4(1) +2.0(2) 
which serves as a check on the ac- 
curacy of the P;'s for the first group. 
A similar check must be made with 
each polynomial in each group and 
with the over-all stage totals at the 
foot of Table 2. 

The next obtain the 
quantities specified by the equations 
in Table 4. Table 4 consists of three 
columns, U, V, and W > which deal 
with the trials by individuals meas- 
ures, trials by groups measures, 
over-all trials measures, respectively. 
Associated with each entry in Table 
4 is the corresponding df. The com- 
putations indicated in Table 4 have 
been carried out in Table 
they can be identified from the 
responding entries in Table 2. Ii the 
work has been done correctly, the first 
entry in each column is the sum ot 
the remaining entries, and 
an important check. 

The only step remaining before.the 
final computations of the sums of 
squares for the summary of the analy- 
sis of variance is to obtain subtotals 
of the groups on I, Pi, Pe, Ps, and P,, 
for the three levels of shock, 0, 2 and 
12, and the two levels of anxiety, L 
and H. This has been done in Table 
6 where the entries are simply sub- 
totals from the right-hand sides of 


Table 2. 


step is to 


’ 
ana 


5 so that 


COr- 


this is 
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TABLE 4 


COMPUTATION OF THE SUMS OI 
U 


Trials by Individuals 


The final step remains of entering 
the sums of squares into the analysis 
of variance summary 
been done in Table 7. 


has 
All the sums of 
squares in Table 7, except those hav- 
ing to do with the subdivision of com- 
ponents of the sums of squares re- 
lated to anxiety, shock, and_ their 
interation, can be computed directly 


table as 


SQUARES OF THE ORTHOGONAL COMPONENTS OF TREND 


V 


Trials by Groups 


from the sums of squares entries in 
lable 5 as indicated in the column 
of Table 7 headed *‘Computation.” 
The number of df for each row of 
Table 7 is obtained by applying the 
“Computation” formulae to the df 
entries in Table 5. Then the sub- 
division of the components of varia- 
tion between group means and be- 
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PABLE 5 


COMPUTED VALUES OF THE TABLE 4 COMPONENTS OF THE TREND 


U: Trials X Individuals 


Y | Uy=h77414--- 
+ 4.92 +-4 72 =482 5300 
120 df 


Ui= 


1 
— (6.44+7.72+ - 
A 


+-17.72) = 376.0140 
| 24 df 


“a (—4.7)2 
i P, 4 


10 

ss +(4,2)8 
= 47.3850 

24 df 


[(—1.4)?4 


{(0.0)2+(3.3)2+ --- | 


+ (6.8)2]= 19.9521 


1 
=— (4.9)? + ( —0.8)? 
( 


hss « whe —4,3)%] 


= 15.2488 


ANALYSIS OF PERSEVERATIVE ERRORS 


V: TrialsX Groups 


! 
8.52+5.12+4 -- Wy= 
4 4 


+-9.72-++-9.9°) = 390.0825 + 28.32 


(54.124-42.82+4 -. 


=332 9179 
30 df 5 df 


(194 Qg 2 . 
= 316.5501 
120 


- 


1 
(27.72-+ 30.8"+ 
v 


Wi= 


((—12.7)24 


40 


(6.4 ) 
= 39.6232 


0 df 


(18.3)? 
Wp,= = 0.9967 
‘ 6 


= 3.3780 


6 df 


1 
=— [|(8.7)2+-( 


280 


he ses ( 


11.7)? 


. 97 2) 


= 5.1027 


6 df 


24 df 


tween group trends can be obtained 
from the entries in Table 6 by the 
usual analysis of variance methods; 
ie, SSg=( >OT,2/Kn,)—C, where 
the subscript, g, represents the sub- 
total for anxiety of shock level. 


Thus, for example, the sum of squares 
between group means for anxiety is 
equal to 1/60(90.2?+104.72) = Wy, or 
1.7521. The sum of squares between 
group means for shock is equal to: 


.1/40 (72,3? + 49.6? + 73.0?) — W, 
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TABLE 6 


SUBTOTALS OF SuMS AND CROsSS-PRODUCTS FOR 
COMPUTATION OF “SHOCK” AND “ANXIETY” 
ScMs OF SQUARES 


Shock 


Meas- __ 


ure 4 


< 


~ 

O pw 
| 

_ <1 

° 


| 
| 
| ed 
to W hb & Nt 


~~ 
mm Ge mF sd Ge 
mow ov 


Anxiety 
Measure 


Tu = 
Pin= 
P, 21 Pru FB 
P; : Psn=— 10.0 


P, iL= he Pyou= 32.8 


=8.8611, and the interaction be- 
tween group means can be obtained 
by subtracting these two sums of 
squares from 21.9054, line B. in Table 
7, which gives 11.2922. Similarly, 
the sum of squares for the linear 
component of the difference in trends 
due to anxiety is: 1/120(P)..°+Pi.47) 
—Wp,. The 120 is 12, the number 
of subjects per anxiety level, times 
10, the sum of the squared poly- 
nomial coefficients. The sum of 
squares for the linear component 
due to shock is 1,80(P,.7+P; 2 
+Pi.127)—Wp,. where the 80 is 8, 
the number of subjects per shock 
level, times 10, the sum of the squared 
polynomial coefficients. The sum of 
squares for the linear component of 
the interaction is the sum of squares 
of the linear component between 
group trends, line C.1 in Table 7, 
minus the sum of squares for the 
linear component due to anxiety and 
the sum of squares for the linear 
component due to shock. The actual 
numerical values for anxiety will be 
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1/120{( — 60.7? + (2.0)?] — 14.3570 
= 16.3804. The sum of squares for 
shock is 1/80[(—4.4)?+(—37.2)? 
+(—17.1)*]—14.3570=6.8381. The 
sum of squares for the linear com- 
ponent of interaction will be 25.2662 
— 16.3804 — 6.8381 = 2.0477. Simi- 
lar quantities are computed for the 
quadratic, cubic, and quartic com- 
ponents of the trend as indicated in 
the Computation column (of sums 
of squares) in Table 7. The general 
computing formula is: 


oP.) 
, 1 ( P ) 
iti igs 
eG [4] 


— 
o 


where the subscript, s, refers to a 
subtotal on P,, for a single level of 
shock or anniety. 

The mean squares of Table 7 are 
computed in the usual way by divid- 
ing the sums of squares by the ap- 
propriate degrees of freedom. It 
remains to determine the appropriate 
error variances for the F tests. In 
general, these are the between-indi- 
viduals mean squares, and the row 
entry for each error term has been 
listed in the next to last column of 
Table 7. The between-group-means 
mean squares should be tested against 
the between-individual-means_ error 
term. The between-group-trends- 
linear terms and the over-all-trends- 
linear term should be tested against 
the between-individuals-linear mean 
square; and, in general, the quadratic 
terms should be tested against the 
between-individuals-quadratic mean 
square, the cubic terms should be 
tested against the between-individ- 
ual-cubic mean square, and the quar- 
tic terms should be tested against the 
between-individuals-quartic |= mean 
square. Appropriate tests for over- 
all trend, line A of Table 7, and be- 
tween-group trends, line C of Table 
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1NALYSIS-OF-VARIANCE TESTS 


7, are a little difficult to justify, but 
probably the best error term for these 
would be the between-individual- 
trends mean square, line E of Table 7. 
These F tests have been made and 
are entered in’ the last 
Table 7. 

The significant F's of Table 7 may 
be briefly interpreted. The significant 
over-all trend means simply that the 
over-all average Vary trom 
to stage during the test. Since 
only the linear component of the 
over-all trend is statistically signifti- 
cant, it may the 
over-all trend is essentially linear. 
Higher order components of the over- 


column of 


scores 


stage 


be concluded that 


all trend fail to attain significance 
There are no significant differences 
between group means due to anxiety, 
shock, or the interaction of 
and shock. The general between- 
group-trends F is not significant. The 
linear components of the differences 


anxiety 


in group trends are, however, highly 
significant, particular, the 
anxious groups tend to have a less 


and in 


negative slope than the nonanxious 


groups. There are also significant 
the different 


shock groups in the linear component. 


differences between 
None of the higher order components 
of the 
trends was significant. 


differences between group 
(This 
anticipated, but the tests were made 
to illustrate procedures. 
significant individual 


performance as shown by 


Was 


There are 
differences in 
average 
the significant F between individual 
means, but this will usually be found 
in the case of reliable measures of 
performance. 

The procedures outlined above can 
readily be applied to longer series of 
trials and can be extended to more 
complicated 
with 


experimental 
higher-order interactions, al- 
though in many instances interpre- 
tations will become very obscure. 
The number of subjects may vary 


designs 
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from group to group, but if an orthog- 
onal design is used the numbers of 
subjects in rows and columns must 
be proportional (9, pp. 281-284). 
One other case which should be 
described brietly is that in which 
there is a single curve made up of 
average values from repeated meas- 
urement on a number of individuals. 
This would be the case if the Grant- 
Schiller data reported earlier had 
been obtained on a single group of 
subjects tested at all the different 
generalization test stimulus values. 
(Then each subject would have con- 
tributed a score for each generaliza- 
test stimulus.) In the case of 
repeated measurements the over-all 
trend can be analyzed as described 
Table 7. There will be no 
group differences but between 
and indi- 
vidual can be ob- 
tained and separated into the orthog- 


tion 


above in 


hetween 


individual between 


means 
trends measures 
In this case the 
Table 4 will not be com- 
the between-individual- 
means sum of squares will be U; — Wy, 


onal components. 
\V column of 
puted, and 
and the corresponding linear, quad- 
ratic, terms will consist 
of the quantities from the W column 
subtracted from 


( ubre . 28., 


the corresponding 
quantities in the U column instead of 
the V's from the U's as in the present 


Table 7. 


The error terms will cor- 


respond to those used in the analysis 
in Table 7. 


The above pro edures do not con- 
stitute a universally applicable rou- 
tine method of analyzing and com- 
paring trends. No such method can 
exist, and no routine can substitute 
for an experimenter’s insight and in- 
genuity. We have found the pro- 
cedures extremely useful, however, in 
comparing curves with respect to 
slopes, curvatures, sharpness of in- 
flections, etc., so that we can recom- 
mend them highly for testing specific 
hypotheses relating to trends. 
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METHODS AND TERMINOLOGY IN STUDIES OF 
TIME ESTIMATION! 


DALBIR BINDRA ann HELENE WAKSBERG 


McGill 


In experiments on time estimation, 

S to make judgments 

about temporal durations. The dura- 
S is asked to judge 


is required 


tion that Ive is the 
standard ; the estimate of the stand 
ard made by S is called the judgment. 
he studies in the area of time esti- 
mation are concerned primarily with 
the relative magnitudes of, and the 
general relations between, the stand- 
ard and the corresponding judgment. 
Despite this relatively simple design 
ol 
quite confusing to read, 


time estimation studies, they 


and at- 


tempt to make generalizations from 


an 


the results of different studies gets 


one involved in many apparent or 


real contradictions. These troubles 
seem to arise from the fact that dif- 
ferent investigators not only use dif- 
ferent methods in their experiments, 
but 


terminologies 


also employ different 


to desi ribe 
the 


Ix- 
1dYy 


sults. Thus, one investigator 


} 


uses 
method of reproduction in the st 
f a problem while another employs 
the method of verbal estimation for 
the same problem. Similarly, 


resea»rt her describe 


one 
to his 


OvVeT- 


( hc 1OSCS 


results in terms of or under- 
estimation of time, while another pre- 
fers to talk in terms of relative speed 
This 
makes it dithcult to « ompare studies, 
unless the exact 


the different methods, and 


of internal and external clocks. 


relations between 
the dif- 


ferent expressions used for describing 


1 This study was supported by a research 
grant (A.P.12) from the National Research 
Council of Canada. It is based in part on an 
M.A. thesis, ‘‘Serial-position gradient in time 
estimation,"’ submitted to McGill University, 
1955. 


T 


n 


iversity 

clarified. Some 
investigators, for example Eson and 
Kafka (2) and Clausen (1), have 
poinied out the existence of this diffi- 
culty, but have dealt with it only in 
a limited way, within the framework 
of their particular problems. Even 
the standard textbooks of experimen- 
tal psychology (for example, 5) have 
not systematically examined — the 
problem of the equivalence or lack of 
equivalence of the different methods 
and terms. This is the task of the 
present 


results, have been 


note. 


\IETHODS 


[hree main methods are commonly 
used in time estimation experiments. 
In the method of verbal estimation, E 
delimits a given interval operatively 
demonstrates the duration of 
, and S is asked to esti- 
mate verbally its duration (the judg- 
ment) in terms of seconds or minutes. 
In the method of production, S 
instructed to delimit operatively an 
interval (the judgment) of a given 
duration (the standard) stated ver- 
bally by £. In the method of repro- 
ion, J: operatively delimits an 
interval (the standard) and then asks 
S to reproduce operatively an inter- 
val (the judgment) of the same dura- 
tion. (A variation of the method of 
reproduction is the method of com- 
parison. In this method FE presents 
two intervals consecutively and S is 
asked to judge their relative duration 
by saying which one is the longer. It 
resembles the method of reproduc- 
tion in that both involve an operative 
presentation of the standard and a 
judgment that also refers to an opera- 
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(1.e., 


the standard 


is 


7 , 
auecl 
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tively delimited duration.) Thus we 
see that both the standard and the 
judgment may be defined in either of 
two different ways, by operative 
demonstration or by verbal state- 
ment. The term elapsed time, used by 
some authors, is used svnonymously 
either with the standard or with the 


judgment, and does not require sepa-. 


rate discussion here. Table 1 shows 
how the three methods differ with 
respect to the defining operations of 
the terms “standard,” “ 
and “elapsed time.”’ 
The three methods also differ with 
respect to another variable. These 
differences are also shown in Table 1. 
This variable is objective vs. subjective 
definition of the terms ‘‘standard,” 
and ‘judgment.’ (Elapsed time is 
always defined in terms of objective 
time.) The standard is sometimes de- 
fined in terms of objective or clock 
time; at other times it is defined with 
reference to subjective or personal 
time. The judgment may also be de- 
fined in either one of these two ways. 
Thus, logically, we can combine 
standard and judgment in four dif- 
ferent wavs: (a) standard defined in 
terms of objective time and judg- 
ment defined in terms of subjective 


judgment,”’ 


DALBIR BINDRA AND HELENE WAKSBERG 


time; (6) standard defined in terms 
of subjective time and judgment de- 
fined in terms of objective time; (c) 
standard defined in terms of objective 
time and judgment defined in terms 
of objective time; and (d) standard 
defined in terms of subjective time 
and judgment defined in terms ot 
subjective time. The last of 
combinations is empirically meaning- 
less. We have only three meaningful 


these 


combinations and, correspondingly, 
three main methods of time estima- 
tion (Table 1). Thus we see that the 
different methods used in time esti- 
mation experiments represent the 
logical combinations of the two ways 
(in terms of objec tive vs. subjec live 
time 
ment. 


of defining standard and judg- 


Description of Results 


The results of time = estimation 
experiments have been described in 
a number of different ways. ‘“Over- 
or underestimation of the standard” 
(5), ‘“‘over- or underestimation § of 
elapsed time” (2, 3), “the relative 
speed of the ‘internal’ and ‘external’ 
clocks,” and ‘‘the relative magnitude 
of the subjective and objec tive tem- 


poral units” (4) are alternative ex- 


TABLE 1 


SIGNIFICANCE OF THI 


Terms 
Verbal Estimation 


Standard 


Interval delimited opera- 
tively by E; refers to ob 
jective (clock 


bv E; 
time. time. 
Verbal estimate made by 
S; reters to 
time. 


Judgment 
time. 
Refers to the objective 


duration of the standard 
(operatively detined). 


Elapsed 
Time 


TERMS “STANDARD,” 
IN VaRIOUS METHODS OF TIME 


Produc tion 


Interval 


‘ters to subiec tive 


Operative estimate made 


subjective by S; refers to ob 


to the objective 
duration of S’s judgment 
(operatively defined). 


Refers 


““TUDGMENT,” AND “ELAPSED TIME” 


ESTIMATION 


Methods 


Reproduction 


stated verbally Interval delimited opera- 
tively by £; refers to ob- 


jective time, 


Operative estimate made 
ective by S; refers to objective 


time 


Refers to the objective 
duration of the standard, 
as well as of S’s judgment 
(operatively defined). 
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pressions that have been used by dif- 
ferent investigators. Thus, a judg- 
ment larger than the standard ob- 
tained with the method of verbal 
estimation may be described as de- 
noting overestimation of the stand- 
ard, or overestimation of elapsed 
time, while a similar result obtained 
with the method of production may 
be described as denoting overestima- 
tion of the standard, or underestima- 
tion of elapsed time. Some investi- 
gators present their results only in 
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sions have not been adequately clari- 
fied. Table 2 attempts to show for the 
different methods the significance of 
these various expressions with respect 
to the relative magnitudes of the 
judgment and the standard. 

Elapsed time refers to temporal 
durations as measured by standard 
clocks (objective or  ‘‘external’’ 
clocks). ‘“‘Internal clock”’ refers to a 
hypothethical mechanism in S, which 
is immediately and directly related 
to performance in a time estimation 


terms ot subjective and 
clocks or temporal units. 


relations between all these 


object ive 
The exact 
expres- 


TABLE 2 


task. The rate of this clock may vary 
from individual to individual and in 
the same individual from time to 


SIGNIFICANCE OF THE RELATIONS BETWEEN THE JUDGMENT AND THE STANDARD IN VARIOUS 


Meruops OF TIME ESTIMATION, 


PHeE Four ENTRIES IN EacH Box ARE EQUIVALENT 


STATEMENTS OF THE RELATION BETWEEN THE JUDGMENT AND THE STANDARD 


Relative 

magnitude 
of judgment --—_—-— 
and standard Verbal Estimation 
Overestimation of the 
standard 


Judgments 1. 

larger than 

the stand- 

ird Overestimation of 
elapsed time 


Internal clock faster 
than external clock 


. Subjective temporal 
units smaller than ob- 
jective temporal units 


Underestimation of 
the standard 


Judgments 
smaller than 
the stand 

ard Underestimation of 


elapsed time 


Internal clock 
than external clock 


. Subjective temporal 


units larger than ob- 


jective temporal units 


slower 


Methods 


Production 


1. Overestimation of the 


standard 


Underestimation of 
elapsed time 


Internal clo k slower 
than external clock 


. Subjective temporal 
units larger than ob- 
jective temporal units 


Underestimation of 
the standard 


2. Overestimation of 
elapsed time 


Internal clock faster 
than external clock 


. Subjective temporal 
units smaller than ob- 
jective temporal units 





Reproduction 


1. Overestimation of the 
standard 


2. Not applicable 


3. Internal clock slower 
than external clock 
during reproduction 


. Subjective temporal 
units larger than ob- 
jective temporal units 
during reproduction 


Underestimation of 
the standard 


Not applicable 


3. Internal clock faster 
than external clock 
during reproduction 


. Subjective temporal 
units smaller than ob- 
jective temporal units 

during reproduction 
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time. Subjective temporal units 
refer to subjective norms of the mag- 
nitude of the duration of seconds and 
minutes, and are presumably ac- 
quired through learning. These sub- 
jective units, like the internal clock, 
may be presumed to vary from indi- 
vidual to individual and in the same 
individual from occasion to occasion 
depending on internal and external 
conditions. It is clear that the con- 
cepts of subjective temporal unit 
and of internal clock are closely re- 
lated.* 

Three examples, one with each 
method, will help to clarify the rela- 
tions shown in Table 2. 
situation in which #, the 
method of estimation, presents a 
standard of 15 sec., and obtains from 
Sa judgment of 20 sec. The judg- 
ment being larger than the standard, 
we say that S has overestimated the 
standard. Since in the method of 
estimation, the standard and elapsed 
time refer to the same thing (Table 
1), that is, to the objective duration 
of the interval delimited by /, we can 
also say that S has overestimated 
elapsed time. The fact that S 
thought more seconds had elapsed 
than actually had, in the given dura- 
tion of the standard, means that his 
subjective temporal units were 
smaller than objective temporal units. 
This is equivalent to saying that S's 
internal clock ran faster than the ob- 
jective clock. 

Next, the situation in 
which /, using the method of pro- 
duction, obtains from S the same 
judgment of 20 when the 
verbally stated standard is again 15 
sec. Since the judgment is larger than 
the standard, we can say that S has 
overestimated the standard. Since, in 


Consider the 
using 


consider 


sec., 


? The authors are grateful to Dr. Peter 
Milner for clarification of some of these 
relations. 


AND 


HELENE WAKSBER( 


the method of production, elapsed 
time refers to the duration of the 
interval delimited by S (Table 1), we 
can also say that he has underesti- 
mated elapsed time. The fact that S 
thought he had produced 15. sec. 
while he actually produced 20 sec. 
(since 15 of S’s subjective seconds 
elapsed in the course of 20. objective 
seconds) meats that S's sub- 
jective temporal units were larger 
than objective temporal units. This 
is equivalent to saying that his inter- 
nal clock ran at a slower rate than 
the objective clock. 

Finally, consider the situation in 
which £, using the method of repro- 
duction, delimits an interval of 15 sec. 


also 


and requires S to reproduce an inter- 
val of the same duration. If S's re- 
production is 20 sec., his judgment is 
larger than the standard; he has over- 
estimated the standard. But since, 
in the method of reproduction, 
elapsed time refers both to the dura- 
tion delimited by / and the duration 
delimited by S (Table 1), we cannot 
make any meaningful statement re- 
garding over- or underestimation of 
elapsed time. Further, in the method 
of reproduction, the relation between 
the magnitude of the judgment and 
the subjective temporal units or the 
rate of the internal clock is not as 
simple as in the other methods. [i 
the other methods, over- or under- 
estimation necessarily implies that 
the S's temporal units 
(and the rate of internal clock) is dif- 
ferent from objective temporal units 
(and the rate of external clocks). But 
in the case of the method of repre xduc- 
tion, whether S's internal clock runs 
faster or slower than the objective 
clock, he may still reproduce the 
duration of the standard quite ac- 
curately, for his subjective temporal 
units are not likely to change from 
the time that he is exposed to the 
standard to the time that he repro- 


subjective 
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duces it. However, if, as in our 
example, S does give a judgment 
larger than the standard, it can be 
that S's subjective temporal 
units were not only larger than ob- 
jective units at the time of the repro- 
duction (judgment), but were 
larger than his subjective units at the 
time of the presentation of the stand- 
ard. This is equivalent 
that, during reproduction, his inter- 
nal relative, not 
only to the external clock, but  rel- 
ative also to his own internal clock 


said 


also 


to saying 


clock ran slower 


at the time of the presentation of 
the standard. In the 
reproduction, then, the relation be- 
tween the magnitudes of judgment 
and standard readily 
and simply described in terms of sub- 
jective temporal 
clock as in 
methods. 
Thus we see that « 
between standard and judgment may 


method of 


cannot be as 


internal 
the 


units or 
the case of other 


given difference 
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signify quite different underlying 
events in the different methods. An 
identical difference may signify faster 
internal clock in the method of verbal 
estimation, slower internal clock in 
the method of production, and slow- 
ing down of the internal clock in the 
method of reproduction. Stating 
results in terms of the relative magni- 
tudes of standard and judgment 
(over- or underestimation of the 
standard) is satisfactory only when 
all data are obtained with the same 
method. When, in the course of re- 
Viewing studies and making generali- 
zations, it becomes necessary to com- 
pare results obtained with different 
methods and to theorize about the 
exact mechanisms underlying time 
estimation, it would seem desirable to 
the terms of the 
relative speeds of internal and exter- 
nal clocks or relative magnitudes of 
subjective temporal 


restate results in 


and objective 


units. 
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RELATIVE TASK DIFFICULTY AND TRANSFER OF TRAINING 
IN SKILLED PERFORMANCE 


R. H. DAY 
University of Sydney 


A number of recent investigations 
have drawn attention to the part 
played by the relative difficulty of 
two or more tasks in the degree of 
transfer from one task to another. 
Results from these experiments have 
important implications for practical 
issues in training as well as for trans- 
fer theory. Since, however, there are 
certain inconsistencies in defining 
difficulty as well as discrepancies 
among the experimental results this 
area of investigation is in need of 
critical evaluation. The main pur- 
pose of this review, therefore, is to 
bring within the scope of one paper 
evidence from numerous experiments 
which have been directly or indirectly 
concerned with the effect of task dif- 
ficulty on transfer in skilled perform- 
ance and to subject this evidence to 
critical analysis. It should be men- 
tioned in this connection that a num- 
ber of the experiments dealt with here 
have not been primarily concerned 
with task difficulty in relation to 
transfer, but the evidence has arisen 
more or less incidentally during the 
course of the experiment. Also, this 
paper will be limited to a considera- 
tion of recent investigations within 
the field of human skill, although it is 
known that some earlier work (6) has 
drawn attention to the relationship 
existing between task difficulty and 
transfer of training. 

Essentially, a skilled task in its 
simplest form possesses three basic 
features. These are: (a) a stimulus 
complex sometimes called the dis- 
play, (b) devices by means of which 
elements in the display are brought 
under control by the responses of the 
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between 
This broad analysis ap- 
plies whether the task is a simple one 
such as writing with a pen upon 
paper, or a very much more complex 
task such as is involved in various 
kinds of tracking behavior. The dif- 


operator, and (c) a linkage 
these two. 


ficulty of a task may vary as a func- 


tion of changes in one or more of 
these features. Th manner in which 
task difficulty has been varied within 
each of these task features will be 
treated first in this paper. This will 
be followed by a review of results ob- 
tained after which certain methodo- 
logical and theoretical issues will be 
taken up. 

For the most part the general pro- 
cedure used in experiments of this 
nature involves either training a num- 
ber of matched groups under different 
conditions of task difficulty followed 
by performance under a different con- 
dition, or the use of the AB, BA 
paradigm or an expansion of it in 
which each S undergoes each experi- 
mental condition. Initial task diffi- 
culty is usually assessed from the 
mean score for a number of trials, the 
score achieved on the final trial, or 
the mean number of trials required 
to reach a criterion. The 
method of estimating the extent of 


certain 
transfer also varies between studies 
Metuops USED To VARY 
TASK DIFFICULTY 
Stimulus Variations 


Barch (3) using a following track- 
ing task (Modified Two-Hand Co- 
ordination Test) varied difficulty by 








RELATIVE TASK DIFFICULTY 


Using the same 
task Morin (17) varied difficulty in 
the same manner. A Rudder Control 
Test (Model C\M120C) was en ployed 
by Gagné and Bilodeau (9) Varia- 
tions in task difficulty were intro- 
duced by changing the width of the 
on-target scoriig area. In an experi- 
ment by Szasfran and Welford (23) in 
throw 
loops of chain into a box the display 


changing target size. 


which Ss were required to 


Was varied in three ways in order to 
obtain three levels of difficulty. Un- 
der the 


first conditions Ss threw di- 


rectly into the box, under the second 
they threw over a bar placed between 
S and target | the third 


and under 
se ' ; 
condition a mirror was placed behind 


the box and a screen before it. | 
the last condition it was ne 
the mi 


use rror in aiming 


“een b Lac lirect 
screen IOCNKET CIF! 


this experiment varia 
SE Ieee | ‘ | 
led necessarily to ch 


of throwing (responding 


to whether S threw directly or ovet 


the bar. Alterat 
in a following tracking ta 


IONS In target 


Variations in task dithcultv in an ex- 
qa Smith (14). 


( hanges 1n target speed necessitated 


periment by Lincoln ai 


] 
speed ot re- 


SI ee | \ 
experiment Dy An- 


hinge Changes 1n 


COTTeSpol 
] In all 
1) task difficulty 


Was 


varied by altering umber of 
moving elements in t lisplay of a 
task. 

cliched by employing a 
(SAM Pwo-l! 


a compe 


trac king accon)- 
following 
and Pursuit Test) and 
(SAM Two-Hand 

task. Under the 
| plav- 


Iso introduced. 


WSAlory 
Coordination Tes 
latter condition changes in dis 
control linkage were a 
A motor-discrimination task in which 
difficulty was varied by changing the 
nature of discrimination training dur- 
used by 


ing the initial task was 
Position and color 


Gagné et al. (7 
discriminations only were used dur- 
ing the initial task and in the final 
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phase both kinds of discrimination 
were involved. 


Response Variations 


Gibbs (10) used a hand-wheel con- 
trolled compensatory tracking task 
in which difficulty varied as a func- 
tion of hand-wheel diameter and in 
which the smaller hand-wheel re- 
sulted in a more difficult task. An- 
other method of varying task diff- 
culty used by Gibbs (10) was that of 
changing the complexity of the path 
to be followed in a following tracking 
task. It is conceivable, of course, 
hat a more complicated path pro- 
vided greater perceptual difficulties 
as well as demanding more complex 
A lever and a pressure 
control were also used to varv diffi- 
ficulty in another investigation by 
Gibbs (11) using a compensatory 
tracking task. The tracking task 
used by Baker et al. (2) was of the fol- 
lowing variety in which variations in 
gear ratios between hand-wheel con- 
trol and follower led to changes in 
the speed of movement of the fol- 
lower relative to hand-wheel turning 


resp mses, 


rate, 


e? 


Variations in Control-Display Linkage 


In the experiment by Barch (3) 
above, task difficulty was varied by 
means of a complete and a partial re- 
versal of control-display relationship 
used in the standard form of the task. 
\ change from a “natural” or “ex- 
pected” to an “unnatural” or “un- 
expected” relationship resulted in an 
increase in task difficulty in an experi- 
ment by Gibbs (10). A following 
tracking task (lowa Pursuit Appara- 
tus) in which difficulty was varied 
in four Ways using standard, reversed, 
and two partially reversed display- 
control linkages was emploved by 
Barch and Lewis (4). Under the 
condition of partial display-control 


linkage reversal either the left or 
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right hand linkage was reversed. It 
is interesting to note that results from 
this experiment are contrary to those 
obtained by Gibbs (10). Lincoln (13) 
varied task difficulty by using direct, 
velocity, and aided controls. 
RESULTS OF EXPERIMENTS 

At this point in the discussion it is 
as well to summarize briefly the gen- 
eral trend of the experimental results 
when task difficulty is varied in the 
numerous ways outlined in the previ- 
ous section. The effects of relative 
task difficulty on transfer when difh- 
culty is changed along a stit 
dimension are mainly negative when 
the simul In thre 
experiments (9, initial 
task difficulty pro 
the target dimensions did not give 
rise to a greater amount of transfer 
than when the initial task was rela- 
tively less difficult than the final 
task.! Target size, however, did exer- 
cise an eflect in a fourth experiment 
(3). In the case of this last eXDC ri- 
ment it should be pointed out that 
linkage variables, the ulty 
fects of which were not determined, 
could have affected the of 
transfer from initial to final task. 
Greater initial task li 
culty in terms of tar 
also failed to gi 
an easier task t 
order. Increasing 
changing from a follow 
pensatory tracking task (1) did result 
in greater transfer when the difficult 


nulus 


alone is varied. 
R?) 


luced by changing 


greater 


Clift ef- 


amount 


\ greater transier to 


lve 
| an did the opposite 
task difficulty by 


ing to a com- 


by Green 
this paper 
was varied over 
| 


1 The results from an experiment 
published after the completion of 
in which target si 
range are in agreement with these pr 
findings. The task used was a following 
ing task. (Green, R. F., Transfer of skill on a 
following tracking task as a function of task 
difficulty (target size) J. Psychol., 1955, 39, 
355-370.) 


a wide 
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task was practiced initially, but only 
when task had an “un- 
natural” control-display — arrange- 
ment. This 
when the tinal task control-display 
relationship was ‘‘natural”’ but still 
less dithicult than the first task. In an 


ask. 
increasing 


\ 
the second 


’ 
; effect was not observed 


aiming 
the difficulty of the 


ing the | 


experiment (23 


_ * ee - 
task by INncre 


aS- 


complexity of the display did 


fer in the diffi- 


- 14 . - ‘ 
result 1n gre. ( 
cult to easy irection. In 


this experi- 
ment, how the variations in the 
per ept 

res in the mode of responding. 


tribute 


perti 

r experi- 
nation task 
toa total task 
ore difticult 


crimination 


ist be be rne 


i this experi- 
the 
ial 
task 
the 

dealt 
transier trom one 


1 


ymparable wi 
which the 
varied. 


of transter trom 


a difficult to a less difficult condition 
than fron ] 1) ie Asal Ps ee | 
than from a less difficult to a difficult 


condition is met with more con 


sist- 
task difficulty is varied 
respect to variables. 
When the task was made more diff- 
cult by altering hand-wheel size and 


ently when 


“oy 
witha response 








course complenxi (10), met! 
control (11), and rate of responding 
(2), 


-ansfer resulted 
this form 


initially. 


when 


Two CX} rin 
dealt with the eff 
task difficulty varied 
the spatial relations 
trol and displ 
Whereas One 
(10) has 


nature ot 
trol 
possible rea 
between the 
tact ne 


alteri 


devices 


eree 
tasks. 

It is 
to the fa 
non u 
be il i! 
only a ] 
which are k 
culty level « 
with 
such 
friction, 
12 
and control 
dius (22), handedness (21), an 
planes of op have been 
systematically 
tion to the extent 
tors affect ease of control. 
of stimulus variables, target 
(15), visual magnificati 


(12), structure of target 


sions 
target 
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round (20), as well as others, have 
been examined 


tT rbjyle 


in relation to task dif- 


‘matic examination of 
nm the point of view 


effects in 
1¢ difficult and 
be ot consid- 
. 
as well as a 
an understanding of 


process in skilled per- 


Pask DIFFICULTY 
on and Control of Variables 
elative diffe ulty of 
is defined in terms 
le of mean. scores 
ng whole or part of the 

i or, in 

the score 

These scores 

! usually to statisti- 

order to establish 

‘of differences between 
these differences prove 

nt, then that form of 
reatest mean ac- 

time of per- 

is said to be 

In transfer studies 
reviewed here, it is of im- 
opera- 
of the 


“as possi- 


isk diffi- 
is an essential 
is to be constructed 
elfects on transfer 
difficulty. 
1¢ h has becon ec plain 
review is the dif- 
ing one task variable 
ale of difficulty without pro- 
unmeasurable changes in 
I Even 
though the relative difficulty of the 
two tasks may have become obvious 
from an examination of performance 


Ggucing 
ther closely related varia 


‘ 
( les. 
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scores, the exact manner in which 
task difficulty arose, or to which task 
variable task difficulty can be attrib- 
uted, has not always been clear. In 
short, in many of the experiments 
discussed the Jocus of task difficulty 
has not always been clearly specifi- 
able. It should be remembered, how- 
ever, that the stimulus or response 
conditions probably never remain the 
same from trial to trial, but, as Os- 
good (19) has been careful to show, 
stimulus and response may retain 
identical functions during learning. 
Even allowing for this fact many of 
the experiments on the effects of task 
difficulty on transfer have failed to 
control adequately the source of dif- 
ficulty. 

In the experiment by Gibbs (10) 
in which a following tracking or 
‘steering’? task was used not only 
course complexity, but also the com- 
plexity of the responses demanded by 
the course, varied under the two 
conditions. The more difficult of the 
two tasks presented the operator with 
a task which required responses of 
greater complexity than the easier 
task, as well as with a stimulus situa- 
tion which may well have been more 
difficult perceptually. It is by no 
means easy in this case to state defi- 
nitely the separate contributions of 
each of these factors to the difficulty 
of the task, or to the extent of trans- 
fer. A similar problem is met with in 
the investigation of Lincoln and 
Smith (14) where variations in target 
speed may well have given rise to 
greater perceptual difficulty, as well 
as greater difficulty in responding. 
In the experiment by Szasfran and 
Welford (23) this same problem arises 
again. The “bar’’ condition not only 
demanded an alteratien in the re- 
sponse by requiring the subject to 
throw over it, but changed as well the 
stimulus situation. It is not possible, 


DAY 


of course, to state whether or not the 
‘bar’ and ‘‘direct’’ conditions § re- 


mained functionally identical while 
at the same time it 1s equally impossi- 


ble to state definitely the locus of task 
difficulty. A similar argument applies 
when the greater difficulty of the 
condition in this investiga- 
The two task con- 
ditions emploved by Andreas ef al. 
(1) varied not only with respect to the 
number of display elements, but also 
in relation 
complexity and control-display rela- 
tionships. 
ble to state the individual contribu- 
tions to task difficulty nor their possi- 
ble interactions, all of which could 
have affected the level of difficulty. 

A further kind of problem in the 
control of variables has arisen when 
changes along one stimulus or re- 
have led un- 
measurable variations along a closely 
related dimension. For example, 
Baker et al. (2) altered the level of 
task difficulty by varying the rate of 
hand-wheel turning to 
move the target-follower through a 
certain distance. These authors have 
pointed out that changes in turning 
rate altered also the extent of move- 
ment as well as the required force of 
movement. Variations in task diffi- 
culty could be due to any one of these 


“Screen” 


tion is considered, 


to course and response 


In this case it is not possi- 


sponse dimension to 


necessary 


factors or to an interaction between 
two or all of these. 

The principal then, in 
many of to 
state the locus of variations in task 
difficulty. Without doing this trans- 
fer of training of skill as a function 
of the relative difficulty of the tasks 
to deal with theoretically, 
since the actual experimental evi- 
dence remains obscure. The probiem 
of isolation and control of “‘dithculty”’ 
variables in skilled tasks not 
lend itself easily to solution, since 


problem, 
the studies reviewed is 


is not eas 


does 








RELATIVE TASK DIFFICULTY 


stimulus and respouse factors are so 
intimately related, and theretore ex- 
ceptionaliy ditheult to isolate under 


experime tal conditions. 


Task Difficulty 


Sta nda r 1 s 


and Performance 
skilled tasks the 
level of performance required is fre- 
quently rather than de- 
manded by the task itself. This point 
can be 


In a number of 


im plied 


made clearer by illustration. 


if ina following tracking task the tar- 
get is 1 in. in diameter, the target fol- 
lower } in. in diameter, and the task 


of the operator is to superimpose the 


latter upon the former, a greater mar- 
if both 


gin of error is permitted than 
a : 
these display elements are } 


q il. IN 


diameter. The extent of permissible 
error is liuplied b: 


ol target al | 


the relative sizes 
In this 
ample, whereas holding the follower 
superimposed upon the small ({ in 
target may not be 


task than 


follower. ex- 


neces sarily a niore 


ditticult keeping it cn a 
larger target, the structure of the 
stimulus situation in the latter case 
1 wider margin 


the off-target 


is such that there ts 
extent 


This 


means that the two conditions men- 


} 
whi ii 


within 


does not count as an error. 


tioned differ with respect to opera- 
tionally detined * 


' 
condition demands 


ditficulty’’ insotatr 
as one | higher 


periorn ance standards rather than a 


higher level of skill. The “dil- 


ficult” 


more 


under these circum- 


Lask 


stances, then, is the one which re- 


; 
quires a higher stand: periorm- 

. . a . 
ance by sett narrower error-toler- 


ance limits. The two task conditions 


do not necessarily differ, or ditfer 
actual 
d tor 


adequate periormance. The ditfer- 


only to a small degree, in the 


level of skill whi n they de al 
ences, sometimes large, between per- 
formance curves for two task condi- 
tions differing in this manner may be 
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due primarily to the fact that the 
operator is directing to the task very 
little, or a great deal of effort, since 
the task implies by its target and 
follower dimensions a certain stand- 
ard of performance. The difficulty 
differences between the task condi- 
tions is more apparent than real. 
Szasiran and Welford (23) have 
suggested that one possible explana- 
tion of the phenomenon of greater 
transfer from a difficult to an easy 
task may be found in the higher 
standards of performance established 
during the difficult initial task, and 
carried over to the easier final task. 
Transfer in this case would be ex- 
pected to be greater than in shifting 
an easy initial task to a more 
dificult final task. The available ex- 
perimental evidence does little to sup- 
port this hypothesis. In the experi- 
ments reported (9, 17) dealing with 
ihe relative dimensions of target and 
follower in following tracking tasks in 
which the task difficulty was varied 
by changing error tolerance limits, 
transfer was positive and about equal 
in going from difficult to easy and 
to difficult conditions. In 
another experiment (3) greater trans- 
ter was found in shifting from difficult 
y, but it has been pointed out 
that control-display relationship fac- 
tors were confounded with stimulus 
factors in this study and the experi- 
mental design considered only the 
to difficult direction. 
lhe concept of error-tolerance im- 
plicit to the task as a determinant of 
performance is not a new one. Mace 
16) has put forward an hypothesis 
standards” which is 
summarized in the — statement 
. subjects aiming at targets de- 
fined to themselves a ‘good,’ ‘fair’ or 
‘poor’ shot not in terms of its absolute 
distance from the bull's-eye, but in a 
way which was relative to the form of 


LrOtil 


Irom @asy 


to easy, 


eas\ 


of “implicit 
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target employed” (16, p. 103). A simi- 
lar view has been expressed by Hel- 
son (12) in his hypothesis of par or 
tolerance. 

The present indications are that 
when the difficulty of a task varies 
only in terms of the implied extent of 
error tolerated, neither condition call- 
ing for a higher level of skill, the phe- 
nomenon of greater transfer from the 
difficult to the less difficult condition 
than from the less to the 
cult condition 
far this has only 
relation to the relative dimensions 


more Qiifl- 
does not occur. So 
been observed in 


target and follower in a tracking 


Much needs to be done with 
to botha variety of skilled ta 
the various feat skilled tas! 


before sions can 


ires ol 
Gefinite conch 
} . +} 1 . 
drawn or hypotheses clearly for 
lated. 
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Human ps 
as judged _ by 
fort, over a more 
for a given stil 
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ance is plotted in terms of 
rocal of accuracy, t 
roughly U-shaped (1 


Helson has 
curve for 
hand-wheel 


time-constants, 


1 a 
speed, and hand- 
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wheel size and inertia. 

Employing this notion of the opti- 
mal band of performance Gibbs (10 
has suggeste | that: 
between two tasks, one of 


. transfer L 
which lies within the tolerance limits, and one 
° : 


of which hes tside this 
be (a) positi I 
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EYSENCK’S TENDER-MINDEDNESS DIMENSION: 
A CRITIQUE 


MILTON ROKEACH ann CHARLES HANLEY 


Michigan State 


In a series of articles and books 
(1.2. 3, 4, 5,6 Evsenck has tried to 
istrate that differ- 


re duc ible 


den (8) 


individual 
ences in social attitudes are 
social attitudes.” 
The first of these he calls “‘radicalism- 


conservatis , and the 


to “two primary 


SC ¢ ond 
“tough-mindedness- tender- minded- 


ness.””' Our primary in this 
the empirical and theo- 
retical bases of the latter of these di- 
mensions. 

The origin of the two factors has 


described by I 


Concer! 


paper is with 


been Vs nek 


ed by the writer 
the 


from 


responses to a 40- 


ide by 
These 


for the 


750 middle 


were drawn 


three major 


itive, liberal, 


ties (COnSsery 


d socia t), and the 


ate 
250 subjects repre 


sentit yr ea h party were equ ited for age, Sex, 


nd edu tion It was shown that iter 


saturations on 


ns hav 
factor radicalism 
m (R) also distinguished at a high 
between voters for the 
tive and radical parties respectively, 
s having low saturations on R failed 

ish between the two parties 
factor (T) 
uncorrelated with R, and 
no discrimination between the political 
On analyzing responses of additonal 
amples of communist and fascist party mem- 
bers, however, it was found that both these 
groups were differentiated with considerable 
accuracy from members of the three demo- 
their low T both 
communists and fascists were thus found to be 


hncance 


Lhe 
found 


tender mindedness Was 


to be quite 
to vive 


partic $ 


cratic parties bv scores; 

tough-minded in comparison with conserva- 

tives, liberals, and socialists (3, p. 563). 
Evsenck describes the bipolar ten- 


der-mindedness—-tough-mindedness 
factor primarily in terms of the items 
1It should be understood that whenever 


this term is used, we follow strictly Eysenck’s 
own usage. 


University 
which measure it (1). The reader 
may refer to the left-hand column of 
Table 1 for the 14 T items and the di- 
rection in which they are to be scored. 
In attempting to find an underlying 
concept corresponding to this factor, 
Ieyvsenck uses William James's di- 
chotomy of tender-mindedness 
tough-mindedness. 


vs. 
Eysenck notes 
“one set of opinions appears to be 
dominated by ethical, moralistic, 
super-ego, altruistic values; the other 
by realistic, worldly, egotistic val- 
(hp: Ot). 

An examination of Eysenck’s writ- 
ings reveals that he explains, or re- 


ues” 


iiterprets, a great body of research 
on the organization of social attitudes 
iid their relation to personality in 
terms of the R and T factors. Thus, 
a considerable portion of his recent 
book, The Psychology of Politics (5), 
is devoted to just this undertaking. 

In view of the importance which 
Evsenck attaches to the factor he 
labels tender-mindedness, and the 
far-reaching conclusions he draws re- 
garding differences on this dimension 
among adherents to various political 
positions ranging from the Right to 
the Left, we felt that a careful scru- 
tiny of his data was in order. On the 
basis of our analysis, we have arrived 
at the opinion that the evidence 
Eysenck himself presents to support 
these conclusions contains grave con- 
tradictions and errors in computa- 
tion, and that the concept itself, at 
least as employed by Evysenck, con- 
tributes little or nothing to the scien- 
tific understanding of the social phe- 
nomena to which he addresses him- 
self. 


169 





170 


MILTON ROKEACH AND CHARLES HANLEY 


FABLE 1 


FREQUENCY OF RESPONSES BY CONSERVAILIVES, LIBERALS, SOCIALISTS, AND COMMUNISTS 
TO THE ITEMS ON THE TENDER-MINDEDNESS SCALE 


Item and direction of scoring 


Items onwhichcommunistsare mosttender-minded: 


1. Coloured people are innately inferior to 
white people. Cl 


3. Waris inherent in human nature 


5. Persons with serious hereditary defects 
and diseases should be compulsorily 
sterilized. (T—) 

8. In the interests of peace, we should give 
up part of our national sovereignty 


(T+) 


10. It is wrong that men should be per- 
mitted greater sexual freedom than 
women by society. (1+ 


Conscientious objectors are traitors to 
their country, and should be treated ac- 
cordingly. (T—) 


13. 


36. The death penalty and 


is barbaric, 
should be abolished *+) 


39, The Japanese are by nature a cruel peo- 
ple. (T—) 


dod 


Items on which communists are most tough-minded :} 


old-fashioned, 
n our behav- 


9. Sunday-observance is 
and should cease to gover 
iour. (T—) 

5. The laws against abortion should be 
abolished. (T 

6. Only by going back to religion can civi- 

lization hope to survive r+ 

I 

Divorce laws shor 

divorce easier 


iId be altered to make 


(t- 


It is right and proper that religious edu- 
cation in schools should be compulsory. 
(T+) 


. Men and women have the right to find 
out whether they are sexually suited be- 
fore marriage by companionate 
marriage). (]'— 


(e.g., 


Proportion of ‘tyes’ answers 
Class 
Cons Lib. Soc. Com 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Worki x 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 


Middle 
Working 
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The basis for our opinion rests 
primarily on an examination of the 
data which Eysenck presents in two 
papers which torm the foundation of 
his work with the T factor. In the 
earlier of these publications (1), 
Evsenck reports the factorial analv- 
sis alluded to earlier, and in the sec- 
ond (2), he presents data regarding 
the performance of middle- and work- 
ing-class members of the conserva- 
tive, liberal, socialist, 
parties. 


and communist 
The mean values of T for 
key senck 
Table 2 under the 
heading “Original mean.’”* 


these groups, as given by 
(2), are shown in 


TABLE 2 
MEAN TENDER-MINDEDNESS SCORES 0 
MIEMBERS OF VARIOUS BRITISH 
POLITICAL PARTIES AND Socto- 
ECONOMIC CLASSES 


Party 


Middle 
Working 


Conserva 
tive 


Liberal 


Middle 
W orki 


Middle 
Working 


Middle 
Working 


There 
the responds to each item 
with a ++, +, 0, —, or —-, ac- 


are 14 items on the T 
subject 


s¢ ale: 


cording to the degree of his agreement 
with the content of 
If the subject responds to 
all items in the tender-minded direc- 
tion (e.g., — —, or —, to Item 1 tn 
our Table 1), he obtains a score of 14. 
If, on the other hand, he answers all 
14 items either in the tough-minded 


or disavreement 


the item. 


? Means for seven fascists were also pre- 
sented, unaccompanied by any breakdown of 
responses to individual items. Hence, we omit 
consideration of this small sample 


direction, or with ‘‘0,”” he gets a score 
of zero on the scale. From Table 2 
it is seen that, according to Eysenck, 
the middle- and working-class com- 
munists are the least tender-minded 
of the four political groups. He indi- 
cates further than this difference is 
statistically significant. 

It is reasonable to suppose, in view 
ot these means, that the communists 
would tend to show up as least ten- 
der-minded on each of the 14 items 
measuring the T dimension. — For- 
tunately, Eyvsenck has presented (2, 
p. 203) the percentage of agreement 
with each of the 14 T items, broken 
down socioeconomi 
and political groupings. On inspect- 
ing these data, we found that the 
communists were the most tough- 
minded of the groups on six of the 14 
items, but were the 


according to 


tender- 
all on the remaining eight 
These tindings, furthermore, 
both the middle- and 
working-class samples. In our Table 
have reproduced Evysenck’s 
data on this point, rearranging it into 
two sections: (a) 


most 
minded of 
ifems. 


obtained for 
1, we 


items on which com- 
the most tender- 
items on which com- 
munists were the most tough-minded. 


munists were 


minded, and (} 


It should come as no surprise to 
find that British communists are 
most tough-minded on six of the 14 
T items, if by tough-minded one 
means that they are opposed to Sun- 
day observance, favor the abolition 
of laws against abortion, are anti- 
religion, believe that divorce laws 
should be liberalized, are against 
compulsory religious education in the 
schools, and favor 


marriage. 


companionate 
Nor does it surprise us to 
find these same communists are most 
tender-minded on the remaining eight 
items, if by tender-minded one means 
that they reject the idea that colored 
people are inferior, disagree with the 
notion that war is inherent in human 
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nature, Oppose compulsory steriliza- 
tion, are willing to give up national 
sovereignty, reject the double stand- 
ard, oppose the idea that conscien- 
tious objectors are traitors, favor the 
abolition of the death penalty, and 
reject the proposal that the Japanese 
are inherently cruel. The conclusions 
we draw from these results, how- 
ever, are considerably at variance 
from those Eysenck arrives at on the 
basis of differences in means on T (cf. 
“Original mean” in Table 2). Fur- 
thermore, we note Eysenck in coming 
to his conclusions makes no reference 
whatever to these data on the indi- 
vidual items. This oversight is in 
sharp contrast to the fact that he 
does not hesitate to point to such 
group differences in response to indi- 
vidual items as supporting the va- 
lidity of the radicalism-conservatism 
factor (1, p. 60-61; see also our quota- 
tion from Eysenck at the beginning 
of this paper). 

It is seen then, that while middle- 
and working-class communists 
the most tender-minded of all 
groups on eight of the 14 items, 
Eysenck’s mean scores on T show 
them as significantly more tough- 
minded than conservatives, liberals, 
and socialists. It was difficult for us 
to reconcile such findings; therefore, 
we proceeded to recompute the 


are 
the 


means for the various samples. This 


is easily done using the information 
contained in Table 1 on the percent- 
age of agreement with each item. The 
simplest way of doing this is to add 
the percentage values of tender- 
minded responses for all 14 items for 
a given group. It is necessary, of 
course, to keep in mind the direction 
of the scoring. For items on which 
disagreement indicates tender-mind- 
edness, the percentage frequencies 
must be subtracted from 100 per cent 
in order to get the correct value to 
use in recomputing the means. Thus, 
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the mean for the middle-class, con- 
servative group is recomputed by 
summing the following tender-mind- 
edness proportions obtained trom the 
data in Table 1: (1, T—) .58; (3, T—) 
33; (5, T—) .31; (8, T+) .32; (10, 
T+) .66; (13, T—) .72; (36, T+) 
30; (39, T—) .42; (9, T—) .64; (15, 
T—) .72: (16, T+) .65; (23, T-—) 
.67; (28, T+) .66; and (29, T—) .65. 
Their sum rounds to 7.6, the mean ol 
the middle-class, conservative sam- 
ple. Thé same value may be obtained 
with a bit more labor by the following 
procedure: (a) Find the number of 
tender-minded responses to each item 
by multiplying each of the above pro- 
portions by 250, the number of Ss 
in the conservative group; ()) sum 
these values: (c) divide this sum by 
250. 

The values obtained trom our re- 
computation are shown in the ‘Re- 
computed mean” column of Table 2. 
A comparison of our 
Evsenck’'s reveals the following: 

1. In only two out of eight com- 
parisons are the means identical. 

2. Both middle- and working-class 
shift) markedly 
toward the tender-minded direction 
toward the values found for the 
the case of the 
middle-class communists, the shift is 
from 6.8 to 7.4; with the working- 
class communists the change 


means with 


communist means 


(i.e., 


other samples). In 


is evel 
more dramatic, from 6.0 to 7.3. 

3. The the 
groups in relation to tender-minded- 
ness is unchanged, but the differ- 
ence between communists and con- 
servatives is 7.4 Vs. 
7.6. 

4. With the working-class groups, 
the recomputed means indicate that 
the communists more 
minded than either conservatives or 
socialists, a drastic from 
Eysenck’s original finding. 

It is conceivable that the discrep- 


order ot middle-class 


indeed slight, 


are tender- 


change 





EYSENCK’'S TENDER- 


ancies appearing in Table 2 arise be- 
cause our method of scoring diverges 
in one respect from Eysenck’'s. He 
always scores ‘‘O” responses as tough- 
Table 1 
gives only frequency of agreement, 
are forced to treat ‘‘O” 
certain 


minded, while we, because 


answers to 
Item 1 and all 
others scored in the negative direc- 
tion) as tender-minded. We do not 
believe that this difference in scoring 


items 


7. 
(e.g., 


technique accounts for the discrep- 
ancies. first that the fre- 
quency of ‘O" responses is the same 
Then, the re- 
computed means should (a) be uni- 
formly higher than those Eysenck 
reports, and the magnitude of in- 
crease should be approximately the 
same for all samples; the 
order of the groups on the T dimen- 
Neither 
With respect 
to a, in the middle-class groups the 


Assume 


in all political groups. 


and (6 


sion should remain the same. 
of the preceding occurs. 


recomputed means for the conserva- 
tives and socialists show no change 
at all; further, the magnitude of in- 
crease in means is greatest in both 
middle- and working-class groups for 
With respect to h 
above, the order of the groups is strik- 
ingly altered in the working-class 
samples. Thus, it is clear that the dis- 
crepancies cannot be explained if we 
assume equal 


responses. 


the communists. 


frequencies ma Oe 


Another possibility comes to mind. 
Perhaps the communists show the 
greatest increases in recomputed 
means because they give relatively 
more ‘‘0"’ responses than do the other 
groups. This could account for the 
discrepancies. There is only indirect 
evidence bearing on this point, and it 
leads us to doubt that such is the 
case. Eysenck states that extreme 
responses to the items are more com- 
mon in his communist samples. Thus, 
the communists have ‘a greater 
tendency to believe strongly in the 


MINDEDNESS DIMENSION 173 


correctness of the attitude held” (5, 
p. 140). We must conclude, there- 
fore, that the discrepancies between 
our means and Eysenck’'s cannot be 
explained away by appealing to the 
manner in which we have had to 
handle “O" 


In view of the foregoing analysis, 


responses, 


Evsenck’s continued 
3, 4, 5) that 
tough-minded conservatives, 
liberals, and not 
ported by his published data. 


contention (2, 
communists are 
than 


more 


socialists, is sup- 


NATURE OF THE TENDER- 
MINDEDNESS ITEMS 


eee 


We do not consider it constructive 
to terminate analysis at this 
What conclusions can be 
validly drawn from these data? 

When one plots the factor satura- 
tions given by Eysenck (1) for the 
14 items, it is immediately apparent 
that a rotation of approximately 45 
degrees produces a striking increase 
in the number of items with near- 
zero saturations on one of the two 
rotated Inspection of the 
items with high positive or high nega- 
tive these rotated 
clearly that two 
kinds of content are involved which 


our 
point. 


factors. 


saturations on 
factors indicates 
are strikingly similar to Ferguson's 
factors of “‘religionism”’ and ‘“thuman- 
itarianism’’ (7) and Kirkpatrick's 
“religiositv’’ and “hu- 


(8).* 


dimensions of 
manitarianism”’ 


The positive pole of the “religion- 
factor is indicated by Items 16 
(going back to religion) and 28 (com- 
pulsory religious education), the neg- 
ative pole by Items 9 (Sunday observ- 
ance old-fashioned), 15 (abolish laws 


ism”’ 


(liberalize di- 
vorce laws), and 29 (companionate 


against abortion), 23 


§ The remainder of the items on Eysenck’'s 
questionnaire also were plotted in this way. 
The interpretation of the two rotated factors 
in no way needs to be altered to account for 


the content of these additional items. 





174 


marriage). It develops that the six 
T items on which communists are 
the most ‘‘tough-minded” are just 
these six “‘religionism" items. The 
communists take a clearly antireli- 
gious position. 

The positive pole of the ‘“‘humani- 
tarianigsm”™’ factor is established by 
Items 8 (give up 
eignty) and 36 (abolish death pen- 
alty), the negative pole by Items 1 
(colored people inferior), 3 (war in- 
herent in human nature), 13 (consci- 
entious objectors are traitors), and 
39 (Japanese cruel by nature). One 
additional item, 10 (oppose double 
standard), has a moderate, positive 
“humanitarianism” saturation, while 
Item 5 (compulsory sterilization) re- 
ceives a moderate, negative satura- 
tion. It turns out that these are the 
eight items on which the communists 
score as the most tender-minded of 
the various samples. 

These rotated factors make it far 
more understandable why the com- 
munists score the most tender- 
minded on eight of the 14 T items. 
All eight are saturated with ‘Shuman- 
itarianism.”’ It is generally known 
that communist supports 
the attitudes expressed by the posi- 


national sover- 


ideologs 


tive pole and opposes the attitudes 
expressed by the negative pole of this 


factor. Similarly, it is easily under- 
standable why the communists score 
as the most tough-minded on the re- 
maining six T items. All six are sat- 
urated with “religionism.”’ Again, 
it is consistent with communist ideol- 
ogy to agree with statements unfav- 
orable to religion and to disagree with 
statements based on religious doc- 
trine. 

When scores on the rotated factors 
are computed for the various groups, 
using the percentages given in Table 
1, the communists are highest on the 
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“humanitarianism”’ factor, followed 
in order by socialists, liberals, and 
lastly, by conservatives. Conversely, 
on the “religionism” factor, the 
conservatives score highest among 
middle-class samples, followed in 
descending order by liberals, social- 
ists, and finally, communists. The 
order in the working-class samples ts 
similar, except that 
higher on “religionism™ than do con- 
servatives. Furthermore, the order 
of the frequency of responses of the 
four middle-class political groups to 
all but two items corresponds exactly 


liberals score 


to the positions of these parties on 
the Right-Left political 
Table 1). A strong tendency of a sim- 
ilar sort is apparent in the working- 
class groups, but it must be remem- 


axis” (cf. 


bered that sampling errors are larger 
here because of the smaller number ot 
subjects involved. Thus, most, if not 
all, of the items on the tender-mind- 
edness scale are clearly related to 
political attiliation. 

The 


Evsenck’'s data in terms of ‘‘religion- 


preceding — reanalysis — ol 


and “humanitarianism”™’ meshes 
with the earlher research by 
Ferguson (7) as well as that by Kirk- 
patrick (8). Evsenck did not mention 
Ferguson's 1941 paper in his original 
publications on the R and T factors 
(1,2), but in The Structure of Iluman 
Personality (4) and in The Psychology 
of Politics (5) he raises the question 
as to whether 
tism”’ 


ism’ 
neatly 


‘“radicalism-conserva- 
“tough-tender-minded- 
superior to 
religionism”’ 


and 
Ferguson's 
“humanitarian- 
the 
Eysenck decides in tavor of his own 
dimensions. 


ness’ are 


and 
ism”? in accounting for data. 
It is instructive to follow 
his reasoning as it applies to the prob- 
lem. First, he argues that 
ism” and are 
to be preferred on the grounds of 


“‘radical- 
“tender-mindedness”’ 
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* More con- 
vincing,’’ Eysenck continues, ‘would 
be experimental evidence showing that 
Tough-mindedness had correlates in 
other fields, such as, for instance, in the 
field of personality, which neither 
Reltgionism IIlumanitarianism 


‘Al proof of this tvpe will be 


“semantic convenience.” 


nor 
possessed. 
ter, and the 
suspend his judg- 


(S, p. 147)? 


would indeed be in- 


attempted in a late 
asked to 


ment until then 


}, , 
Cha} 


rea der 


Such evidence 
We made a careful search 
of the remainder of The Ps 
Politics for 
mental test. 


structive. 
this experi- 
Our search was in vain. 
The issue is never again raised in the 
book. This treatment, of 
course, can hardly be | 


promised 


sort. ol 
considered an 
adequate resolution of the problem 
and serves only to reinforce the con- 
clusions we have reached 


SUMMARY 


This paper is an evaluation of 
Eysenc k's the 
he calls 


research on factor 
tough-mindedness-tender- 
Using a 14-item T scale 


such 


mindedness. 


with diverse religious 


as Sunday 


dealing 
observance, abor- 


issues 


tion, divorce, companionate -mar- 
and with such other social 
as race differences, the cruelty 


riage, €tc., 
issues 
of the Japanese, compulsory steriliza- 


tion, the double standard, and 


COn- 
scientious objectors, E:evsenck reports 
mean scores which indicate that mid- 
dle- and working-class British com- 
munists are more tough-minded than 
British 
conservatives, liberals, and socialists. 

Our Evsenck’s pub- 
lished contradicts his 
The major 


middle- and working-class 
analysis of 
data clearly 
findings and conclusions. 
points considered were: 

1. Contrary to Evsenck’s conten- 


* Italics ours. 


TENDER-MINDEDNESS DIMENSION 


wholoow of 
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tion that communists are the most 
tough-minded, we tind that on 8 of 14 
items both middle- and working-class 
communists were the most tender- 
minded of all the political groups. 
Eyvsenck overlooks these data in ar- 
riving at his conclusions. 

2. Our recomputation of the mean 
tender-mindedness scores, from data 
presented by Eysenck, strongly sug- 
gests serious errors in the original 
calculation of his means. 

3. The means and our 
analysis of Eysenck’s data on fre- 
individual 
items necessitate a serious modifica- 
tion of his conclusions regarding dif- 


corrected 


quency of agreement to 


ferences in tender-mindedness among 
the various political groups. 

4+. The 8 (out of 14) tender-mind- 
which the commu- 
nists were the most tender-minded of 
all parties turn out to involve content 


} 
ecainess items on 


which corresponds closely to Fergu- 
) and Kirkpatrick's (8) di- 
mensions of humanitarianism. 

5. The items on 
which the communists were the most 
tough-minded all pertain to a reli- 
gious dimension, also found by Fergu- 


* cd 
SsOn S (é 


remaining Six 


son and by WNirkpatrick. 

6. Consistent with common knowl- 
IE-ysenck’s conservatives score 
the highest on the religious items, 
followed in order 


edge, 
by liberals, social- 
ists, and communists. Communists 
score the highest on the humanitar- 
lanism items, followed in order by 
socialists, liberals, and conservatives. 

Our analvsis leads us to the con- 
that 


der-mindedness, as 


tough-mindedness-ten- 
conceived and 
measured by Evsenck, has no basis in 
fact. It is based on miscalculations 
and a disregard for a significant por- 
tion of his data. It conceals rather 
than the attitudinal differ- 
ences existing among political groups. 


clusion 


reveals 
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In the Psychology of Politics (11) 
and in a number of earlier articles and 
papers the writer has tried to do three 
things. In the first place, he has 
tried to construct a dimensional 
framework to deal with the interrela- 
tions obtaining between a wide va- 
riety of different social attitudes. The 
results of several experiments and 
analyses, carried out in different coun- 
tries and on different samples, led to 
the hypothesis that these relation- 
ships could be described with consid- 
erable accuracy in terms of two or- 
thogonal (independent) factors, la- 
beled radicalism-conservatism (R fac- 
tor) and tough-mindedness—tender- 
mindedness (T factor). No attempt 


was made, as Rokeach and Hanley 
(16) claim, “to demonstrate that in- 
dividual differences in social attitudes 
are reducible to ‘two primary social 


attitudes’’’; such a reduction would 
fail to take account of the specific 
part of the variance, which is con- 
siderable, and could not be effected 
by the use of the factorial method, 
on which our conclusions were based. 

In the second place, an attempt 
was made to follow up a hypothesis 
formulated quite early in the history 
of this research (5), to the effect that 
the IT dimension was correlated with 
certain personality variables, while 
no such correlation was postulated 
for the R dimension. The specific 
hypothesis tested was that intro- 
verted people would tend to be ten- 
der-minded, while extraverted people 
would tend to be tough-minded. In 
this connection, the hypothetical 
constructs “‘introversion’”’ and ‘‘ex- 
traversion”’ are used in terms of the 


operational definition given them in 
Dimensions of Personality (7), The 
Scientific Study of Personality (9), 
and The Structure of Human Per- 
sonality (10). 

In the third place, an attempt was 
made to link up both the attitude 
dimensions and the personality stud- 
ies with the main body of modern 
psychology by showing that the re- 
sults found in our experiments could 
be deduced from certain postulates 
of learning theory, and that in this 
way the particular structuring of 
variables observed could be explained 
by reference to a larger body of well- 
known facts. The claim is made in 
the Psychology of Politics (11) that 
these three aims have been accom- 
plished to a reasonable approxima- 
tion. In view of the fact that if this 
claim could be substantiated the work 
reported would be of some interest to 
social psychologists concerned with 
the integration of their field of study 
with that of general and experi- 
mental psychology, well-considered 
criticism possible weak- 
nesses in the chain of proof is wel- 
comed by the writer, as this would 
make possible the design of more con- 
vincing experiments, or lead to a 
more accurate restatement of the 
theory. It is to be regretted that the 
critique by Rokeach and Hanley 
(16) not seem to be related 
closely enough to the facts of the 
case to be useful from this point of 
view. 

Their first point of criticism ap- 
pears to be that in one paper (8) the 
writer concluded that the communist 
groups tested had low scores on ten- 
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showing 


does 
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der-mindedness; this they claim to be 
an error based on miscalculation. 
Computational errors do, of course, 
occur even when considerable care 
is taken. The writer does not believe 
that any such errors occurred in this 
case, for three reasons. In the first 
place, computations were done with 
all the usual checks, and were then 
repeated independently; identical re- 
sults were obtained the second time. 
This does not conclusively eliminate 
the possibility of computational er- 
rors, but makes their occurrence 
rather less likely. 

In the second place, the argument 
presented by Rokeach and Hanley in 
favor of their view is a very indirect 
one, as the published article does not 
contain enough detail to make ac- 
curate computation possible. As they 
themselves admit, in discussing the 
“0” responses, ‘‘there is only indirect 
evidence bearing on this point... .""! 
It is, in fact, impossible to argue back 
from the published figures in the way 
that Rokeach and Hanley are doing, 
and no rigorous development of their 
criticism is indeed attempted. When 
they say of their 
that “in only two out of eight com- 
parisons are the means identical,” it 
should be that 
this is quite irrelevant as their re- 
computations leave out part of the 
data. The fact that the means in two 
cases are identical is purely fortui- 
tous; there is no reason why any of 


“recomputations”’ 


clearly understood 


1 Even this “indirect evidence” of theirs is 
based on curious reasoning and factual inac- 
curacies. Thus Rokeach and Hanley say: ‘‘He 
{[Eysenck] alway “O" responses as 
tough-minded...."’ This is quite untrue. 
Several different scoring schemes have been 
tried out at various times, such as the one 
mentioned in the 1947 paper (6, p. 65). The 
work of Melvin (15) has contributed greatly 
to a final decision on the best method of deal- 
ing with the problem of the ‘‘0" response. 
Any recomputation based on false assump- 
tions of this kind must be regarded as irrele- 
vant. 


scores 
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the means should be identical. 

In the third place, Rokeach and 
Hanley have been very partial in 
their selection of evidence. They 
say that “in view of the foregoing 
analysis, Eysenck’s continued con- 
tention that communists are 
tough-minded than conservatives, 
liberals, and socialists, is not sup- 
ported by his published data.” Yet 
in the Psychology of Politics (11, p. 
141), there is given a detailed dia- 
gram of the scores made by com- 
munists and fascists, as compared 
with a group of matched subjects of 
conservative, liberal, and socialist 
attitudes; this diagram out 
completely the conclusion criticized 
by Rokeach and Hanley. The figures 
on which it is based, contained in a 
doctoral dissertation by Coulter (4) 
were available to at least one of the 
two critics, and the published dia- 
gram gives sufficient detail to show 
that this independent research sub- 
stantiates the contention that com- 
munists are more tough-minded than 
people supporting other political 
parties (with the exception of the 
fascists). The failure to mention this 
corroborative evidence is difficult to 
explain. 


more 


bears 


Equally important in this connec- 
tion is another research, completed 
only recently, and as yet unpublished 
This 


not 


study bv Nieniewitzkv could 


have 


been known to Rokeach 
and Hanley, but the results are very 
relevant to the question of whether 
the original data can be duplicated 
in repeated and independent studies. 
Basing his study on a properly strati- 
ned sample of the French population, 
and using a slightly modified and im- 
proved form of the T scale, Nignie- 
witzky found that communists had 
a mean score of 10.3; fascists had a 
mean score of 10.2; communist fel- 
low-travellers had a mean score of 
10.2. The mean score of supporters 








THE PSYCHOLOGY OF POLITICS: A REPLY 


of all the other main French parties 
was 17.6! ‘These figures are even 
more impressive than those found in 
England; they strongly support our 
view regarding the position of com- 
munists in the two-dimensional factor 
space. 

One further result from Nignet- 
witzky's study may be of interest. 
He found that in an analvsis of vari- 
ance carried out over the main politi- 
cal parties in France, the score on the 
T dimension gave an even better dif- 
ferentiation than did the R score (in 
Anglo-Saxon countries the opposite 
is usually found). Other scales, such 
as the F scale, which bears consider- 


scale,’ and 


able similarity to the T 
correlates reasonably highly with it 


itt most studies, were very much in- 
ferior to both the T 


These facts, added to those reported 


and the R scales. 


in the Psychology of Politics may serve 
as an adequate comment on Rokeach 
and Hanlev’s contention that ‘‘tough- 
mindedness-tender-mindedness, as 
conceived and measured by Eysenck, 
has no basis in tact. It 1s based on 
miscalculation and a disregard for a 
significant portion of his data. It con- 
ceals rather than reveals the attitudi- 
nal differences existing among politi- 
cal groups.”’ 

All in all, then, 


our answer to 


? Historically the T scale 
everal vears before the F scale 
sion was isolated in 1944 (5), 
published in 1947 (6). 
lished in 1950 (1), without mention of the T 
scale in spite of the obvious similarities. 
Neither was Ferguson’s (13) contribution 
mentioned, which also is very relevant to the 
concepts underlying the F Rokeach 
and Hanley take the author to task because 
he “did not mention Ferguson's 1941 paper 
in his original publications on the R and T 
factors."”. They omit to add that in an even 
earlier paper, not quoted by them at all, the 
writer (5) had thoroughly and in detail dis- 
cussed the contribution not onlv of Ferguson 
(12), but als of Carlson (13), Thurstone (17), 
and many others 


was published 

The T dimen- 
and the scale 
The F scale was pub- 


scale. 
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Rokeach and Hanley is that proper 
care was observed in the calculation 
of the data; that their criticism is not 
based on rigorous calculation, but on 
argument and surmise; and that two 
independent repetitions of the study, 
one of which was known to Rokeach 
and Hanley, give results even more 
striking in their support of our hy- 
pothesis than did the original study 
investigated by Rokeach and Hanley. 

Allied to the criticism regarding 
the alleged computational errors is 
Rokeach and Hanley's discussion of 
the detailed results of the 1951 paper. 
They take the writer to task because 
“in coming to his conclusions (he) 
makes no reference whatever to these 
data on the individual items.” This 
is the first time the writer has been 
criticized for obeving Rule 1.22, Sub- 
section d, of the APA Publication 
Manual (2), which “Data 
should be presented no more than 
Although it is appropriate to 
refer to tabular data in the text of an 
article, care should be taken not to 
repeal data unnecessarily in the sec- 


reads: 


once. 


tion on results, in the discussion, and 
in the summary.” The tabular pres- 
entation was suthciently detailed for 
Rokeach and Hanley to draw conclu- 
sions from it at considerable length; 
no editor would have permitted the 
writer a discussion of similar length 
in addition to the tabulation. How- 
ever, the main point of their discus- 
sion indicates that Rokeach and Han- 
ley fail to understand the chief char- 
acteristic of dimensional analysis. 
Communists as a group have loadings 
on two orthogonal factors; conse- 
quently their responses to individual 
items are determined not only by 
their tough-mindedness, but also by 
their radicalism. Items relating to 
anti-Semitism, war attitudes, the 
death penalty, and so forth should be 
answered in the affirmative because 
of their loading on tough-mindedness. 
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but in the negative because of their 
loading with radicalism; the outcome 
of the ensuing conflict will depend on 
the respective loadings, as well as on 
the exact position of each person in 
the communist group on the two fac- 
tors. The T score combines in equal 
proportions radical and conservative 
items and thus gets rid of the compli- 
cation introduced by the R factor; 
in just the same way the R score, 
combining in equal  porportions 
tough-minded and tender-minded 
items, gets rid of the complications 
introduced by the T factor. This 
point appeared too obvious and in- 
deed elementary to discuss at length 
in the paper; the reader interested in 
the detailed construction of the 
scales, and the problems encountered, 
may be referred to a separate publica- 
tion by Melvin (15). 

We may now turn to the second 
major criticism presented. In dis- 
cussing the similarity between his 
dimensional scheme and that pre- 
sented by Ferguson (12, 13), the 
writer (11, p. 147) has commented 
that a rotation of 45° would turn the 
one pair of reference axes (T and R) 
into the other (humanitarianism and 
religionism). There is an obvious 
semantic convenience in employing 


widely used and accepted terms, 


such as radicalism-conservatism, par- 
ticularly when there is evidence that 
the scale for measuring such a factor 
coincides with the actual major politi- 


cal party groupings (6). Further- 
more, it more reasonable to 
refer to communists as “tough- 
minded radicals," or to fascists as 
“tough-minded conservatives,’ than 
to refer to conservatives as ‘‘religious 
antihumanitarians,”’ or to socialists 
as “nonreligious humanitarians,”’ as 
we would have to do if we accepted 
the Ferguson scheme. Indeed, this 
rechristening seems to lead to a re- 


seems 
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ductio ad absurdum when we find 
Rokeach and Hanley arguing that 
“communists score the highest on the 
humanitarian items.’ To find com- 
munists considered as the most 
“humanitarian” group of all is cer- 
tainly a little startling! 

Hlowever, this argument regarding 
the superiority of the R and T dimen- 
sions on the basis of semantic con- 
venience was only used by the au- 
thor in a very subsidiary way. As 
pointed out in The Psychology of 
Politics, in a passage quoted by 
Rokeach and Hanley, ‘‘more 
vincing would be experimental evi- 
dence showing that Tough-minded- 
ness had correlates in other fields, 
such as, for instance, in the field ot 
personality, which neither Religion- 
ism nor Humanitarianism possessed. 
A proof of this type will be attempted 
in a later chapter..." (11, p. 147). 
Rokeach and Hanley comment: 
“Such evidence would indeed be in- 
structive. We made a careful search 
of the remainder of The Psychology 
of Politics for this promised experi- 
mental test. Our search was in vain. 
The issue is never again raised in the 
book.”’ The writer finds thiscomment 
difficult to understand. <A whole 
chapter, entitled Ideology and Tem- 
perament, is given over to a discus- 
sion of the experimental evidence re- 
lating to this problem, and _ several 
different approaches are reported, all 
of which support the hypothesis that 
tough-mindedness and extraversion 
are related to each other, as required 
by our hypothesis. The reader's at- 
tention is drawn particularly to Fig- 
ure 30, on p. 178 of The Psychology of 
Politics, which reports the results ob- 
tained by George (14) in a direct at- 
tack on this problem. It will be seen 
there that his measure of extraversion 
is situated almost exactly on the 
tough-minded factor axes. Anyone 


con- 
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familiar with dimensional analysis 
will be able to see for himself the re- 
sult of rotating the axes through an 
angle of 45°, thus bringing them in 
line with the Ferguson system. This 
would considerably reduce the cor- 
relation of extraversion from its pres- 
ent reasonably high size, and would 
leave us with two rather low and un- 
important correlations with religion- 
ism (negative) and with humanitar- 
ianism (negative). Iurthermore, the 
relation between extraversion 
tough-mindedness observed in 
study was predicted in 
theoretical 


and 
this 
terms of 
considerations: no such 
prediction was made to our knowl- 
edge with respect to Ferguson's two 
factors. Rokeach and Hanley’s fail- 
ure to see the relevance of this whole 
chapter, and of this study in particu- 
lar, to the point in question is diffi- 
cult to understand. 

They also fail to take into ¢ 
what 


ccount 


} 


to the writer is the most im- 


} 
t 
1 
t 


portant chapter in the whole boo 
viz., the concluding chapter entitled 
‘*\ Theory of Political Action.” 
an attempt has been made to deduce 
the actual structure of attitudes 
found, as well as the relationship of 
the T factor to extraversion-introver- 


hy 


Here 
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sion, from general learning theory; 
it was also deduced that there should 
be no relationship 
tween the R factor and the main per- 
sonality variables. None of the rela- 
tions pointed out in this chapter, 
and none of the deductions made, 
would be applicable to the Ferguson 
factors. Rokeach and Hanley do not 
ion this argument, although to 
the writer it the 
cogent one in coming to a decision be- 
This 
to grips with the 
as a Whole appears to 


consistent be- 


ment 
appears most 
the two rival schemes. 
failure to 
writer's theory 


tween 


Come 


him the outstanding weakness in the 
critique to which this is the reply. 
The authors have quite arbitrarily 
pir ked out certain isolated points, 
have disregarded the great 
supporting 


well 


mass ol 
evidence each 


] 


CONCLUSION, as 


separate 
as the intercon- 
nections betwee 


{ the research 


n the different parts 
( under review, and 
have come to conclusions which are 
not in fact borne out by a careful 
perusal of the evidence. The reader 
will be able to form his own opinion 
after comparing the facts as outlined 
in The Psychology of Politics with 
Rokeach and Hanley’s critique. 
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CARE AND CARELESSNESS IN 
CHARLES HANLEY ann MILTON 


ir 


Our joint reaction to Eysenck’'s 
reply (6) is that it evades or beclouds 
each of the specilic issues we raised 
‘“‘tender- 
To our mind, the very 
It is 
not Lhe Psychology of Politics (5) we 
have criticized. The reader whose at- 
tention is directed to this book will 


concerning his work on 
mindedness.” 


title of his reply is misleading. 


not be able to see for himself the seri- 
ous computational errors, the omis- 
and 
have 
Evsen k 
republishes in this book a good deal, 
but not all, of the data he presented 
Most con- 


Spi uous is the omission of his data 


sion of contradictory evidence, 


the 
pointed to in our critique.! 


erroneous conclusions we 


in earlier journal articles. 


on the responses of communist and 
working-class samples to the individ- 
ual items of his scale. He republishes 
such data only for middle-class con- 
liberals, 
It is precisely the omitted data which 
show 


scervalives, and socialists. 
that the communists are more 
‘“tender-minded” than conservatives, 
liberals, and socialists on 8 of the 14 


T itenis, and it 1s these same data that 
enabled us to 


mean 1] 
and 
thereby discover that the means he 
reports are incorrect. It 
that we felt it 
scrutinize Eysenck’s earlier and, by 
far, fuller reports (3, 4), rather than 
the data Ensenck has chosen to pre- 
sent in his book. 
Now let us examine Eysenck’s re- 


recompute 


scores for his various groups 
is for this 


reason necessary to 


1A reading of this book reveals shortcom- 
ings and errors other than those we have con- 
sidered. Many of these will be dealt with in a 
separate critique by Professor Richard 
Christie in a forthcoming issue of this journal 
(1). 


PSYCHOLOGY 
ROKEACH 


Wichigan State University 


ply to each of our criticisms. 
Concerning responses to individual 
T items. Eysenck states that by 
“dimensional analysis” it is possible 
that the com- 
munists, whom he describes as being 
the 
most tender-minded of any group on 
8 of the 14 T 


runs somewhat 


to reconcile the fact 
tough-minded, turn out to be 
items. His reasoning 
each of 
the 14 items not only measures indi- 
vidual differences along the tender- 
mindedness axis but also along the 
radicalism axis. 
ment with a specific item is scored 
both as tender-minded and as radical, 
and that the subject is a communist. 


as follows. 


Suppose that agree- 


Evsenck attributes such agreement 


to radicalism and not to tender- 


mindedness. He does not, however, 
consistently apply this line of thought 
to the other items. Thus, suppose 
that agreement with an item is scored 
both as tough-minded and _ radical, 
and the subject is a communist. If 
Ey send k were to follow his rule, such 
should be attributed to 
radicalism and not to tough-minded- 
This he does not do. Instead, 
he attributes such agreement to the 


agreement 
ness. 


operation of tough-mindedness. Sup- 
pose, again, that agreement is scored 
both as tender-minded and conserva- 
tive, and that the subject is a con- 
servative. If Eysenck were consist- 
ent, he should interpret such agree- 
ment as being due to conservatism, 
and not to tender-mindedness. This 
time, however, such agreement is at- 
tributed to both conservatism 
tender-mindedness. 


and 


We cannot escape the impression 
that following Eysenck’s line of argu- 
ment permits one to shift one’s ex- 
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planation of results from one axis to 
the other according to which axis one 
wishes to grind. 

Eysenck asserts also that the rea- 
son he never referred to the contra- 
dictions in the item response frequen- 
cies is that he was prevented from 
doing so because he was ‘‘obeving 
Rule 1.22, Subsection d, of the APA 
Publication Manual,”’ which is to the 
effect that data should not be re- 
peated unnecessarily. 

We find ourselves hesitant to take 
this explanation seriously. We will 
take space here only to point out that 
the APA Publication Manual 
Eysenck refers to was published in 
1952. The article containing the 
contradictory findings appeared in 
1951—in the British Journal of So- 
ciology! 

Concerning 


Eysenck’s incorrect 


means. When we recomputed means 
for Eysenck’s groups from the data 
he presents on response frequencies to 


items, we found serious discrepancies 
between his means and ours. We 
suggested that his means were in er- 
ror to such an extent that his con- 
clusions regarding differences in 
tough-mindedness among commu- 
nists, socialists, liberals and conserva- 
tives had no data. 
Eysenck replies ‘‘computations were 
done with all the usual checks, and 
were then repeated independently.” 
He denies the correctness of our re- 
computed means because, he says, it 
is “impossible’’ to do so using the 
response frequencies. These recom- 
putations, he continues, are based on 
‘false assumptions” and are “‘irrele- 
vant,”’ because we do not know how 
the ‘‘0’’ responses were scored. We 
are referred to the work of Melvin 
who “has contributed greatly to a fi- 
nal decision on the best method of 
dealing with the problem of the ‘0’ 
response.” 

Several comments are in order here. 

1. The most convincing way to 


basis in his 
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demonstrate the fallaciousness of a 
critic’s recomputations would be to 
make available the raw data and de- 
tailed instructions on how they are 
to be scored. Eysenck has made no 
effort to do this. 

2. Wedo not believe that Eysenck’s 
conclusions are worthy of serious at- 
tention if different ways of scoring 
“0’’ responses can make such whop- 
ping differences in findings. 

3. Eysenck himself describes on 
page 276 of The Psychology of Politics 
exactly how Melvin scores the T 
scale. As far as one can tell from 
Evsenck’s description (5, p. 65), his 
original scoring method is identical 
with Melvin’s. 

4. However the “0” responses may 
have been handled, the crucial data in 
Eysenck’s table of response frequen- 
cies to individual T items are already 
classified into proportions of ‘‘Yes” 
answers, and the accompanying text 
describes the scoring system in terms 
of ‘“‘Yes"’ answers to items (4, pp. 201, 
203). Thus, if his means fail to cor- 
respond with the means which any- 
one may calculate from the data pre- 
sented in this table, Eysenck’s means 
must be incorrect. 

5. We have noted previously the 
carelessness which Eysenck has shown 
in presenting factual material. Let us 
note two further examples from his 
reply. 

First, Eysenck gives in his reply 
the year 1955 as the date of Melvin's 
Ph.D. thesis. In The Psychology of 
Politics reference is also made to this 
thesis. On page 276 the date given is 
1954; and on page 301 the date is 
1953! (We note too that Eysenck 
refers to this unpublished dissertation 
as ‘‘a separate publication by Mel- 
vin.’’) 

Second, Eysenck writes in his re- 
ply: ‘‘Rokeach and Hanley take the 
author to task because he ‘did not 
mention Ferguson's 1941 paper in his 
original publications on the R and T 
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factors.. They omit to add that in 
an even earlier paper, not quoted by 
them at all, the writer (5) had thor- 
oughly and in detail discussed the 
contribution not only of Ferguson 
(12), but also of Carlson (13), Thur- 
stone (17), and many others.”’ 

A caretul 
statement 


consideration of this 
that ‘Ferguson's 
1941 paper” and the “Ferguson (12)” 
reference are not the same paper! The 
“Ferguson (12)” cited by Eysenck 
turns out to be another study by Fer- 
guson published in 1939.2 We suggest 
that re-reads the above 
quotation it will have an entirely dif- 
ferent meaning! 


shows 


if one now 


These two examples demonstrate 
marked carelessness by Eysenck in 
assembling factual 
material. Applicauion of some of the 
“usual checks,’’ which Eysenck states 
he employed in calculating his means, 
would also have prevented these er- 
rors from occuring. 


and presenting 


Concerning the independent repett- 
tions of Eysenck’s research. Eysenck 
states that two independent repeti- 
tions of his original study by Coulter 
and Nigniewitzky 
more striking in their support of our 
hypothesis.” 


“oive results even 


The internal consistency and com- 
putational correctness of a particular 
investigator's data cannot be estab- 
lished by referring to two or even 
more than two independent studies. 
On the basis of our 
Evsenck’s published 
have come to the conclusion that his 


evaluation of 
research, we 


data do not support his hypothesis. 
Whether the two unpublished studies 
of his students confirm or deny 
Eysenck’s hypothesis is an entirely 
separate issue. These studies deserve 

2 Involved here is more than a careless sub- 
stitution of one reference by another. Fer- 
guson’s 1941 paper describes a Religionism 
factor that Eysenck in 1944 (2) complained 
was missing from Ferguson’s 1939 study 
Eysenck’s 1944 paper makes no mention of 
Ferguson's 1941 paper. 
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evaluation 
merit. 


strictly on their own 
We look forward to the pub- 
lication of these reports.’ 

Concerning the alleged supertority 
of Eysenck’s factors over Ferguson's 
factors. \WWe pointed out that one may 
more understand Evsenck’'s 
Ferguson's 


easily 
referring to 
and “}umanitarian- 
ism’”’ factors rather than to Evsenck’s 
Rand T. We called attention to the 
fact that Eysenck promises in 7he 
Psy hology 0} Politics to produce “ex- 
perimental showing that 
Pough-mindedness had correlates in 
other helds, 


results by 


‘‘Religionism”’ 


evidenc e 


such as, for instance, in 
the tield of personality which neither 


Religionism nor Humanitarianism 


possess. 
We then pointed out that Eysenck 
The 


I-vsenck 


never again discusses this issue. 
matter is simply dropped! 
replies that he finds our ‘‘comment 
difficult to understand. A_ whole 
chapter, entitled ‘Ideology and Tem- 
perament,’ is given over to a discus- 
sion of the experimental evidence re- 
lating to this problem, and several 
ditferent approaches are reported, all 
of which support the hypothesis that 
tough-mindedness and extraversion 
are related to each other, as required 
by our hypothesis.” 

The only way possible to demon- 
strate the proof he promises of the 
superiority of his factors over Fergu- 
It is 
not sufficient to present only the cor- 
between T 
personality variables. 


son's is by a pilting procedure. 


relations and selected 
It is necessar\ 
the 


parable correlations between Fergu- 


to produce, in addition, com- 
son's factors and the same personality 
variables. Only then can one make a 
choice between the two alternative 
explanations. This is the test 
Eysenck promises but fails to make. 

There is no reference whatever in 

A detailed analysis of the deficiencies in 
Coulter's study, as reported by Eysenck, will 
be found in Christie's paper (1). 
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his chapter on Ideology and Temper- 
ament (or in his Figure 30, for that 
matter)* to the Ferguson factors or 
their correlates. Indeed, nowhere 
does he suggest that he has computed 
scores on these dimensions and cor- 
related them with the relevant per- 
sonality variables. We, therefore, 
must reaffirm our earlier statement: 
“Such evidence would indeed be 
instructive. We made a careful 
search of the remainder of The Psy- 
chology of Poliiics for this promised 
experimental iest. Our search 
in vain. The issue is never again 
raised in the book.” 

A final suggestion. If one is in- 
terested, for the sheer fun of it, in 


Was 


4Evsenck (6) asserts that anyone can see 
from his Figure 30 (5, p. 178) that rotating 45° 
to the Fergusonaxes reduces ‘the correlation of 
extraversion from its present reasonably high 
size, and would leave us with two rather low 
and unimportant correlations with religionism 
(negative) and humanitarianism (negative).” 
Following this ad /roc suggestion, we made the 
necessary rotation on Figure 30, measured 
the loading of extraversion on the new Relli- 
gionism axis, and found that it is .88 of the 
magnitude of extraversion’s loading on the 7 
axis. The text (5. p. 179 
relation between 


states that the cor- 
extraversion and. tender- 
mindedness is .41. Thus, the estimated cor- 
relation between extraversion and Religion- 
ism is 88 per cent of .41 (negative), or —.36. 
Readers may wonder why Eysenck should 
consider a correlation of .41 as ‘reasonably 
high,” but a correlation of “rather low 
and unimportant.” 


36 as 
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confirming Eysenck’s conclusion that 
fascists and communists are similar 
to each other and different from dem- 
ocratic subjects, we offer the follow- 
ing recipe. Construct a 20-item scale. 
Include 10 items referring to accept- 
ance of communist ideology (e.g., 
“Communism is the most desirable 
form of government,” etc.). Let the 


remaining 10 items refer to accept- 


““Fas- 
form of 
Give the ques- 
to communists, 


ance of fascist ideology (e.¢., 
cism is the desirable 
government,’ etc.). 
tionnaire 


most 


socialists, 
liberals, conservatives, and fascists. 
Factor items. [Emerge with two fac- 
Call “radicalism,” the 
other, ‘‘tough-mindedness."” — Score 
agreement with communist items and 
disagreement with fascist 
“radical.”’ Discover that communists 
are the most radical, fascists the most 
conservative, and democratic groups 
Score agreement with 
both communist and fascist items as 
“tough-minded.”’ Discover that com- 
muuists and fascists are both “tough- 
minded,” because they agree with 10 
of the 20 Find democratic 
groups to be “‘tender-minded,” be- 
cause they with the 
items. 

In our opinion, Eysenck is caught 
in precisely this sort of trap. We 
hope, as a result of this exchange, 
that others will be able to:steer clear 
of it. 


tors. one, 


items as 


in between. 


items. 


agree none of 
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